


What is a Romance language? How is one Romance language related 
to others? How did they all evolve? And what can they tell us about 
language in general? In this comprehensive survey Rebecca Posner, a 
distinguished Romance specialist, examines this group of languages 
from a wide variety of perspectives. Her analysis combines philolog¬ 
ical expertise with insights drawn from modern theoretical linguistics, 
both synchronic and diachronic. She relates linguistic features to his¬ 
torical and sociological factors, and teases out those elements which 
can be attributed to divergence from a common source and those 
which indicate convergence towards a common aim. Her discussion is 
extensively illustrated with new and original data, and an up-to-date 
and comprehensive bibliography is included. This volume will be an 
invaluable and authoritative guide for students and specialists alike. 
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PREFACE 


This is an extraordinarily difficult book to fit into the Language Surveys 
series, which was originally intended to provide a means for those who are 
more or less proficient in linguistics to have access to knowledge about 
languages they are not familiar with. The volumes in the series that have 
so far appeared differ considerably among themselves, sometimes con¬ 
cerned with diverse languages within one geographical area, sometimes 
with genetically related languages, sometimes - like the two volumes on 
Pidgins and Creoles - with languages grouped together because of their 
social function and historical development. They are perhaps most suc¬ 
cessful when they provide the sole authoritative account of the languages 
they describe. The types of exposition adopted are as diverse as the sub¬ 
ject matter. Some examine each language or sub-group of languages sep¬ 
arately, some treat chosen themes by adducing comparative data; some 
adopt a strictly synchronic approach, while others introduce diachronic 
considerations. 

One problem with writing about the Romance languages is that so much 
has already been written on them: I shall survey the history of Romance lin¬ 
guistics in my introductory chapter. Elementary introductions for the lay¬ 
man abound - my own paperback written some thirty years ago (Posner 
1966) is apparently still going strong. ‘State of the art’ surveys are also read¬ 
ily available - the one in four volumes I edited (1980-4) with my colleague 
John Green (Posner & Green eds.) was in 1993 supplemented by a fifth vol¬ 
ume, perhaps not the last. Comprehensive reference works are also readily 
available, the latest being the massive project from Niemeyer (Holtus, 
Metzeltin & Schmitt 1988-) that is still in the process of publication. 

The task of keeping this book down to a single volume has meant that 
much interesting material has had to be omitted or merely touched on. 
For phonology and morphology especially reference will still have to be 
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made to the comprehensive manuals, some of which date back more than 
a hundred years. Specialist monographs have since then flooded into 
libraries and now seem to be increasing in number, especially on syntactic 
and lexical questions. Adherents of virtually every school of linguistics 
have used Romance data to validate their ideas. Generativists’ interest in 
Romance largely postdates my comments in Iordan, Orr & Posner (1970), 
when there were comparatively few Romanist generativists. The 
‘Principles-and-Parameters’ model, in particular, has attracted linguists 
with Romance mother tongues, who initially were led to contrast their 
own languages with English and so to recognize that the Romance lan¬ 
guages have much in common. A result is that examples from the 
Romance languages are now commonplace in the works of non- 
Romanists. 

Yet, in preparing the book I have often been frustrated by the lack of 
solid information in many problematic areas of Romance linguistics. The 
abundant studies on the standard languages are often slanted towards the 
theoretical predilections and social prejudices of their authors; the body 
of material on non-standard languages is huge but patchy, both in geo¬ 
graphical and social coverage and in exemplification. Transcriptions of 
texts are inconsistent and sometimes uninterpretable. I have tried to tran¬ 
scribe as accurately as I can into the IRA alphabet whenever the phonetic 
detail is relevant to the argument. In other cases, though, more tradition¬ 
al orthography suffices for the purposes of exposition. Similarly, 
morph-by-morph translation sometimes seems helpful, if laborious, but 
more often I have made do with general indication of the sense. 

In citing examples (which are usually fairly randomly chosen samples 
of the material available) I have tried to avoid tendentious classification of 
‘languages’ and ‘dialects’, using, as far as is feasible, geographical nomen¬ 
clature. Thus, for instance an Italian dialect (Ital. dial.) is a non-standard 
variety spoken in Italy, and where relevant the locality in which it has been 
attested is cited. 

Judgements about usage are sometimes unreliable, not only because of 
the inherent difficulties of collecting such material, but also because of 
the ideological viewpoint of some of the speakers and investigators. I have 
frequently found that, when I had expected merely to summarize the work 
of others, I have had to embark on my own detailed examination of a par¬ 
ticular topic. Much work remains to be done, and some Romance vari¬ 
eties are near to extinction, so there is a degree of urgency. All too often. 
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though, new work that promises to be exciting turns out to be little more 
than a reformulation of findings that were already well known to readers 
of older works. It is tempting to cite works hot off the press, but it is dif¬ 
ficult coolly to assess their lasting worth until the novelty has worn off. 

I have had, of course, to rely very much on others, not only in their pub¬ 
lished work, but also for comments on, and correetions of, my own drafts 
and for the sight of unpublished fieldwork, lecture notes and the like. 
There is no way in which I could have first-hand knowledge of all cur¬ 
rently used Romance varieties: although quite widely travelled, I could 
not possibly even have visited every locality within the Romance-speaking 
world. Similarly, I cannot be personally familiar with the vast amount of 
published historical documentation, including high-quality literary 
works, in the languages, let alone the yet-untapped resources that still lie 
untouched in archives and attics. Most recent works on Romance (like 
Posner & Green 1980-93, Harris & Vincent 1988, Holtus, Metzeltin & 
Schmitt 1988- ) are collaborative enterprises, accepting that no one indi¬ 
vidual can have sufficient expertise to cover all the field. The result must 
be, however tough the editorial control, a certain lack of cohesion and 
consistency. 

The problem of which works to cite in the bibliography was daunting. 
I therefore decided to rely heavily on reference to previous surveys and 
bibliographies, rather than to attempt a comprehensive coverage, which in 
itself would fill up several massive tomes. I have for each chapter given an 
indication of further reading, but this can be only a preliminary guide, 
which the serious student would have to follow through further. 
Regretfully, I have had normally to limit my references to monograph- 
length works published during the 1980s and 1990s. Important periodical 
articles will normally be cited in these works and in the bibliographical 
surveys. 

To undertake my task required a greal deal of hubris: mine has threat¬ 
ened to desert me several times during the preparation of the book. What 
I have tried to do, however unsuccessfully, is something a little different 
from what has been done before, aiming at a theoretical coverage of top¬ 
ics that I believe to be of interest to all linguists, combining both syn¬ 
chronic and diachronic material, drawn from a wide variety of Romance 
varieties, and not merely standard languages. I have chosen not to give a 
language-by-language account, trying to treat every topic with a wide 
range of comparative material, both ancient and modern. Some friends 



XIV 


Preface 


and colleagues have urged more exemplification from more languages, 
others have suggested that I indulge too much in ‘philological’ detail. 
Because I believe that there will be a body of readers who take pleasure in 
the actual examples, as well as those who have a more theoretical interest 
in language, I have tried to illustrate from actual language material as 
much as possible, avoiding, as far as I can, the standard examples found 
in most handbooks. 

Some of my consultants complain that I often am hesitant in my judge¬ 
ments, over-using such hedges as ‘perhaps’, ‘apparently’, ‘allegedly’ and 
so on. I do not apologize for this: even in those areas in which I count 
myself an expert, I have frequently found that new material has radieally 
ehanged my view of linguistic reality. My ignorance remains almost bot¬ 
tomless, even though I have spent my life in the study of Romance lan¬ 
guages. I am very conscious that anything I say too firmly may very soon 
be demonstrated to be nonsense. I should like to think that this book will 
not be outdated before it reaches the shelves of its readers: I therefore 
leave some questions open for further research, while presenting my own 
current views. I have nevertheless tried to present a reasonably coherent 
case for writing a book on ‘The Romance Languages’ as a unit, rather 
than on each individual language. I know how assailable my stand is, but 
I am prepared to defend it against anticipated attacks. 

It remains for me to thank all the people who have given me advice and 
help while I have been preparing this book - some of them at considerable 
cost of time and patience. Most of all I wish to thank Roger Lass for read¬ 
ing a draft of the whole book and giving me the benefit of his inspiring 
insights from the point of view of a Germanist and theorist, especially in 
phonology and historical linguistics. Many others have read part of the 
book and supplied material: in particular I should like to mention John 
Green, Valerio Lucchesi, Yakov Malkiel, Ed Tuttle, Max Wheeler, Roger 
Wright, as well as many of my pupils who have suffered, though not 
silently, imposition of my views. I should also like to thank Judith Ayling 
at the Press for her patient help. My thanks also to Christopher Posner for 
preparing the Indices, and to Michael Posner for his patience and for 
giving me the time required to complete this task. Needless to say, I have 
only myself to blame for errors and shortcomings: I only hope that these 
are not so numerous that they spoil the reader’s enjoyment. 


Rebecca Posner 
Oxford 



LANGUAGE NAMES 


For convenience language data are cited according to geographical area; for more 

detailed information see chapter 5. The following abbreviations are used. 

dial, a non-standard regional variety within a geographical area (‘dialect’). 

ns.: non-standard usage that is not necessarily regionally based. 

Otherwise reference is made, where appropriate, to a standard variety, using the 

following general classification. 

Cat. (Catalan): the Barcelona standard, but also used to refer to varieties spoken 
in Catalonia, Roussillon (France), the Balearics, Valencia and 
Alghero (Sardinia). 

Dalmatian: the Romance language once spoken along the Dalmatian coast (for¬ 
mer Yugoslavia). The variety for which we have most evidence is from 
the island of Veglia (Krk), off the Istrian peninsula. 

Francoprovenpal: varieties used in the south-east of the French-speaking area, 
including part of Switzerland and the Val d’Aosta in Italy (see chap¬ 
ter 5). 

French (includes non-standard varieties of northern France, usually cited accord¬ 
ing to locality). 

Galician: spoken in north-west Spain, closely related to Portuguese. 

Ital. (Italian): the standard and also regional varieties (dialects) of Italy. 

Occ. (Occitan): used for the standard and non-standard varieties spoken in south¬ 
ern France, cited according to locality. 

Ptg. (Portuguese): used for the European and Brazilian standards, and for non¬ 
standard varieties, cited according to locality. 

R-R (Rhaeto-Romance): a cover term for some Romance varieties spoken in 
Switzerland (Grisons) and northern Italy (Dolomites and Friuli). 
Forms are cited according to locality (see chapter 5). 

Rum. (Rumanian): Daco-Rumanian is the standard language of Romania; for 
Arumanian, Megleno-Rumanian and Istro-Rumanian see chapter 5. 
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Sard: varieties spoken in central and southern Sardinia. 

Span. (Spanish): used for European and Latin-American standards, and for non¬ 
standard varieties cited according to locality. 


NOTES ON CONVENTIONS USED 

Phonetic transcription is in the IPA alphabet. Foreign citations in traditional 
orthography are italicized. Latin words are cited in small capitals; except where 
inappropriate the traditional convention is followed of citation of nouns in the 
accusative form without a final consonant (e.g. lupu for lupus, lupum, etc.). 
Terminological keywords are in boldface, usually at their first appearance. 



ABBREVIATIONS 


acc. 

accusative case 

C 

consonant 

dat. 

dative case 

f. 

feminine gender 

gen. 

genitive case 

m. 

masculine gender 

n. 

neuter gender 

nom. 

nominative case 

p.1-6 

verb persons 1-6 (1st, 2nd, 3rd sg./pl.) 

pi. 

plural 

pres. 

present tense 

pret. 

preterite 

sg. 

singular 

subj. 

subjunctive 

SVO etc. 

subject-verb-object etc. word order. 

V 

vowel 

V1/V2 

verb-first/verb-second word order 
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Introduction 


WHY THE ROMANCE LANGUAGES? 

Why write about the Romance languages? Perhaps just because, like 
Everest, ‘they are there’. But are they - as an entity, that is? Is there any¬ 
thing about the Romance languages that makes them susceptible to treat¬ 
ment as a single unit? That is a question I shall be addressing all through 
this book, and to which I believe the answer is ‘Yes!’ Others, however, deny 
that synchronically they have more in common than any arbitrarily cho¬ 
sen collection of languages. The only reason why we treat them together 
is, then, because they are related historically, and are all derived from 
Latin, or, conceivably, a near-relative to Latin. In this view the features 
shared by the Romance languages would form the common heritage, or be 
a consequence of borrowing, or result from a common process of ‘drift’, 
or be merely coincidental (or any combination of these). 

This must, surely, be true. What I would contest is the next step in the 
argument; that synchronic comparison of the languages would yield no 
linguistically interesting insights, and that only a historical approach is 
possible or desirable. I shall maintain that, although there are many differ¬ 
ences of detail between the Romance languages that cannot be accounted 
for except in terms of history (social as well as linguistic), there are many 
cases in which description of the synchronic state of one language loses an 
important dimension if no comparison is made with the other languages. 

This is so not only of the standard languages, which have both influ¬ 
enced each other and recoiled from each other, but also of the non¬ 
standard languages, which often preserve, or innovate, parallel features 
that are spurned by their prestigious counterparts. Thus, what appears a 
puzzling anomaly in one language may, when it appears in another, 
become intelligible because the conditioning is more transparent. 
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Introduction 


Whether there is such a thing as a ‘Romance type’ - a guiding principle 
which governs how the language mechanism ticks - is something I shall 
discuss at length. How this may relate to a postulated ‘Eurotype’, which 
would presumably owe much to the common history and aspirations of 
European peoples, is also worth considering. 

Of one thing I am certain: it is misguided to attempt to account for 
every feature of the Romance languages without recourse to historical 
explanation. On the other hand, some mechanisms in the languages can 
be illuminatingly described in synchronic terms, by concentrating on the 
often minute ways in which the mechanism differs from language to lan¬ 
guage, and taking for granted that in a broad sense it works in much the 
same way in all. 

Nevertheless it is still true that any comparative study of the Romance 
languages will cast most light on the workings of linguistic history. In a 
sense, as linguistic geographers were quick to notice in the past, historical 
development of the language can be mirrored by the geographical distri¬ 
bution of variants, with some regions appearing to be more innovative, 
whereas others resist change. Thus with synchronic variation it is possible 
to factor out the time parameter of historical linguistics, substituting for 
it geographical or social space. 

Time, however, must come back into the picture when we are faced with 
the introduction or the loss of a variant in any one language: here we shall 
be concerned primarily with changes in the socially acceptable ‘norm’ of 
a speech community, and only secondarily with system or type changes. 
How far changes in the norm feed into the language acquisition process 
(perhaps via a change in parameter setting) is a burning question. 

History of Romance comparative grammar 

Most comparative studies of Romance have, until very recently, been his¬ 
torical in their orientation. Romance philology is often said to have start¬ 
ed with Dante’s unfinished Latin treatise De vulgari eloquentia (written 
after 1304), in which he divides the Romance languages into three groups, 
according to their word for ‘yes’, si, oc, oui, contrasting them with other 
European languages. It is far from clear, however, whether Dante had a 
view on their relationship with each other and with Latin. His aim was to 
devise an ‘illustrious vulgar tongue’ that would rival Latin as the language 
of serious discourse, and in so doing he disparaged the claims of contem- 
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porary spoken Italian varieties. It is likely that in the early Middle Ages 
the Romance languages had been viewed as fluctuating spoken versions of 
Latin, which remained the main vehicle for writing. Separate names for 
the vernaculars, based on their geographical spread, began to be used reg¬ 
ularly only from the late twelfth century (see 3.3), and there is evidence of 
multilingual usage within the Romance area, not only between Latin and 
vernacular, but also between the different vernaculars. 

It was only later, from the sixteenth century on, when vernaculars were 
ready to take over completely from Latin, that the question of the 
Romance-Latin affiliation was hotly debated: some even claimed that 
Romans had always used Italian varieties in colloquial speech and that the 
Latin of texts had never been other than a literary convention. The other 
Romance languages would in this view reflect adulteration from the bar¬ 
barian tongues with which they were in contact from the time of Roman 
colonization (see 6.4). For the French their language stood somewhat 
apart, as incorporating Germanic (Frankish) elements, or, even, as being 
derived from Greek or Celtic, rather than from Latin. 

During the eighteenth century, when the regularity of phonological 
change began to be perceived, treatises on the origins of Romance lan¬ 
guages began to appear in profusion, but the title of ‘father’ of Romance 
philology is usually bestowed on Fran^ois-Juste Raynouard (1761-1836), 
writing in the post-Napoleonic period. He discerned in the Romance lan¬ 
guages common features that could not all be ascribed to Latin heritage, 
and therefore postulated a common ancestor, la langue romane, that he 
believed replaced Latin before AD 1000, and which most nearly resem¬ 
bled the language of the troubadours (now called Old Occitan, or Old 
Proven 9 al). 

Raynouard’s hypothesis received short shrift from the more profession¬ 
al German comparative philologists, in particular August Wilhelm von 
Schlegel, who viewed Romance as the spontaneous outcome of the 
decomposition of Latin, after the disappearance of the normative pres¬ 
sures exerted by a stable cultural environment. 

The first systematic treatise on Romance was published in 1831 by 
Lorenz Diefenbach, who maintained that the Romance languages derived 
from a popular form of Latin, further adulterated by contact with other 
languages, especially in the case of French, which he characterized as 
‘very creolized’. Diefenbach’s fellow-student at Giessen, Friedrich Diez, 
was to become the first Professor of Romance Philology, at Bonn. His 
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Grammatik der romanischen Sprachen in three volumes, which went into 
three editions and two further reprints, began to appear in 1836. He too 
believed that popular Latin was at the origin of the Romance languages 
which developed by phonological processes and functional adjustments 
dictated by the ‘spirit’ of the people, originating from speakers’ negligence 
in the absence of a normative standard. For him Rumanian (‘Wallachian’) 
was only a semi-Romance language, and French was more influenced by 
German than by Celtic. 

The French translation of the Grammar, under the direction of his illustri¬ 
ous pupil Gaston Paris, appeared in 1869. 

For half a century Diez’s Grammar remained the Bible of Romance 
philology. Meanwhile numerous treatises on individual languages, espe¬ 
cially on medieval texts, and essays on the question of the origin of one of 
the languages, were appearing. We should mention especially the contri¬ 
butions of Hugo Schuchardt, who battled against the idea of sound-laws 
and traced the origin of Romance phonological features to archaic popu¬ 
lar (‘Vulgar’) Latin, and of Graziado Isaia Ascoli, whose work mainly 
concerned dialectal usage. 

After Diez’s death in 1876 a new wave of German Romance philologists 
came on the scene, associated with Neogrammarian views about linguis¬ 
tic change, as stemming from mechanical and exceptionless phonological 
shifts, limited in time and space. Gustav Grober, who became professor at 
the new University in Strasburg after the Prussian annexation of Alsace, 
had launched the tone-setting Zeitschrift fur romanische Philologie in 1877 
and in 1888 edited the first edition of the Grundrifi der romanischen 
Philologie, a ‘state-of-the-art’ survey of the discipline. 

Regularist and reconstructionist views were dominant, especially in the 
contribution of Wilhelm Meyer (later Meyer-Liibke), whose own 
Grammatik der romanischen Sprachen (1890-1902), translated immediate¬ 
ly into French, was to replace that of Diez, for phonology and morphol¬ 
ogy, at any rate, and to remain the standard work. For Meyer-Liibke the 
pivot of grammatical study was phonetics: change is a physiological 
process not controlled by meaning, which causes disintegration, which 
may then be patched up by analogical change, dictated by functional 
needs. He was able to improve on Diez, not only in phonological method¬ 
ology, but also in his coverage of language data, drawn from both literary 
languages and from dialect material. 
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On syntax, however, he is on less firm ground, hampered by a lack of 
methodology and by the multiplicity and heterogeneity of this material, 
mostly drawn from older texts. Comparative Romance syntax remained a 
stumbling-block for later scholars, many of whom hesitate to plunge into that 
murky pond, or refuse to isolate syntax, preferring to treat morphosyntax as 
a composite functional level. It is only recently that new ground has been bro¬ 
ken in the study of Romance syntax, both synchronic and diachronic. 

The Franco-Prussian war is held by some to have occasioned the often bit¬ 
ter divorce between French and German linguistic thought. Certainly, by 
the end of the nineteenth century the French were turning away from reg- 
ularist and reconstructionist Romance philology and concentrating more 
on dialectology and on sociological and pragmatic approaches to the 
individual languages. ‘Idealist’ linguistics, with its emphasis on the con¬ 
scious creativity of linguistic processes and on the communicative func¬ 
tions of language, became the hallmark of Romanist work within the 
Romance-speaking countries. The standard French work on Romance 
philology, Edouard Bourciez’s Elements de linguistique romane, which first 
appeared in 1910, considers each language separately and adopts a func¬ 
tionalist view of change as resulting from a tension between economy of 
effort and desire for effective communication. 

In spite of Saussure’s remarks, in the Introduction to his Cours de lin¬ 
guistique generale, on the importance of Romance philology, there were 
no important additions to comparative Romance linguistics between the 
world wars, except for the idealistically slanted methodological survey by 
lorgu Iordan, especially in its 1937 English version prepared by John Orr, 
Introduction to Romance linguistics: its schools and its scholars. New ver¬ 
sions of this influential work appeared in a variety of languages after the 
Second World War, when there was an upsurge in interest in comparative 
Romance studies. 

The most successful of the immediate postwar works was Carlo 
Tagliavini’s Le origini delle lingue neolatine: introduzione alia filologia roman- 
za, first appearing in 1949, which concentrated on the early stages of the lan¬ 
guages, juxtaposing rather than comparing them. Similarly juxtapositional 
of hterary languages, except for a ‘Common Romance’ chapter, is Walter 
Elcock’s popular The Romance languages, which first appeared in 1960. 

Heinrich Lausberg’s three-volume Romanische Sprachwissenschaft, 
beginning publication in 1956, is concerned with the genesis, rather than 
the evolution, of the languages. As a functionalist, influenced by the ide- 
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alist stance of Eugenio Coseriu, Lausberg sees language structure as in 
unstable equilibrium, ready to topple if undermined at its weak points, 
and insists on social factors and speakers’ consciousness of their language 
as a community possession. 

One of his innovations, stemming from his own expertise in dialectal 
morphophonology, is to distinguish the classic Vulgar Latin phonological 
system from the ‘archaic’ one of Sardinia, Lucania and Africa and the 
‘compromise’ one of Balkan Romance and Eastern Lucania. Another is 
his introduction of the concept of ‘mechanization’, a phenomenon that 
permits repetition and reproduction of a chance innovation, enhancing 
social intercourse by reducing the effort required to formulate expression. 
This is related, it seems, to what is often called grammaticalization, by 
which linguistic items are impoverished (‘bleached’) of lexical content and 
come to serve as grammatical items, and perhaps to morphologization, by 
which hitherto general phonological rules are restricted to marking cer¬ 
tain morphological functions. 

Whereas Lausberg’s work is representative of European structuralism, 
Bloomfieldian structuralism, in its comparativist guise, is exemplified by 
the work of the Cornell Romanists, Robert A. Hall Jr and Frederick B. 
Agard. Hall has long defended the regularist and reconstructionist stance 
in Romance linguistics, especially against ‘idealism’ and generative 
approaches: three volumes of his comparative work on the Romance lan¬ 
guages appeared between 1974 and 1983. Agard’s 1984 A course in 
Romance linguistics has both a diachronic and a synchronic volume: the 
latter provides a contrastive grammar of five standard languages (French, 
Spanish, Portuguese, Italian, Rumanian) using an eclectic grammatical 
model - a hierarchical phrase-structure grammar, with some relational 
concepts and simple transformations, not unlike traditional grammar. 

Maria Manoliu-Manea, a Californian Romanist of Rumanian origin, 
was perhaps the first systematically to essay the application of 
Chomskyan ‘standard theory’ to comparative Romance problems, in her 
1971 Gramatica comparatd al limbilor romanice. Her later work (1977; see 
Manoliu-Manea 1985) on Romance typology leans more towards case 
grammar and generative semantics, though grounded in European struc¬ 
turalism. 

Recent years have seen a great increase in research into comparative 
Romance linguistics. Even within the generative camp, where interest is 
focused on language universals and cognitive structures, rather than on 
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languages as such, Romance languages have become a centre of attention. 
Assuming that language is organized according to restrictive principles, it 
is suggested that nevertheless specific parameters of variation can account 
for some differences between languages. The language learner is thought 
to determine the value of these parameters through exposure to language 
data. 

The interest of Romance languages to the theorist is that they provide 
abundant material from systems that are closely similar but which differ 
interestingly in some features, which may be attributable to parametric dif¬ 
ferences. By minute examination of variation within Romance it is believed 
that parameters, which account for a cluster of seemingly unrelated differ¬ 
ences, can be pinpointed. Such Romance features as use or non-use of clitic 
pronouns, order of elements, the status of inflections or of auxiliaries, 
causative constructions and so on have been discussed from this angle. 

The range of language material used has widened to cover not only 
standard languages, but also regional dialects, so that our knowledge of 
some areas of Romance syntax has improved immensely as a result of 
these studies. On the diachronic plane, the question has also been 
addressed of how, over time, the parameter setting of a language can 
change: what are the characteristics of language input that persuade the 
language learner to decide that a different value should be attributed to 
the parameters of variation? 

Although there is as yet no general work on comparative Romance syntax 
within this framework, there are several that tackle specific questions within 
Romance: the names of Richard Kayne, Luigi Rizzi, Luigi Burzio, Maria 
Luisa Rivero, Oswaldo Jaeggli, Hagit Borer, Jean-Yves Pollock, Jean-Roger 
Vergnaud are among the many that can be mentioned. 


WHAT DO ROMANISTS WORRY ABOUT? 

It is, however, probably true that these generativists would not regard 
themselves primarily as Romanists, but as linguistic theorists who, often, 
are native speakers of a Romance language and for whom the Romance 
data provide a stimulus to refinement of conceptual argumentation. 

Many others who may indeed call themselves Romanists concentrate 
on one Romance language, or on the transition from Latin to Romance. 
They will usually have rather different interests from the Principles-and- 
Parameters brigade, in that they do not limit themselves to study of indi- 
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vidual manifestations of a hypothetical core grammar, but also, and per¬ 
haps principally, to what Chomskyans call the ‘periphery’ and regard as 
accidental-historical residue, chance idiosyncrasies, features arising from 
language contact, dialect mixture, social or pragmatic factors or con¬ 
scious seeking for stylistic effect - but what others regard as the fascinat¬ 
ing mysteries of language that await elucidation. Not all are convinced 
that one Romance language can cast light on another: in some cases 
national pride is at stake, in others the promotion of elegance in language 
use is more important than the investigation of causes. 

Comparative Romance linguistics as such is still, as in the past, mainly 
fostered outside the Romance-speaking area - especially in German¬ 
speaking countries - though Romania (Rumania) has a long and respect¬ 
ed tradition in this field. German interest in Romance may be a result of 
historical accident, as a spin-off from the nineteenth-century popularity 
of comparative philology, with roots in the search for a Germanic identi¬ 
ty, but in the postwar period it has gained momentum, embracing all 
manifestations of both modem and medieval Romance. 

(a) Classification 

Among the topics that most preoccupy comparative Romance linguists is 
the classic one of the relationship of Romance to Latin and of the Romance 
languages to one another. Classification of the languages is traditionally 
discussed in ‘family tree’ terms, with branching determined on the basis of 
common innovations - nearly always phonological - but several other 
schemata, including synchronic typologies, are used (see chapter 5). 

(b) Dialectology and sociolinguistics 

Of particular interest to Romanists are the social functions of language 
and the relationship of non-standard varieties (especially regional dialects) 
to standard languages, of spoken to written usage (see chapter 8). 

(c) Stylistics and pragmatics 

Given that several of the languages are vehicles for literary discourse of 
undisputed high quality, it is not surprising that Romanists are particu¬ 
larly concerned too with stylistic and pragmatic study. 
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(d) Phonology 

Phonology has been at the centre of historical Romance linguistics, cer¬ 
tainly since the Neogrammarian era, with particular emphasis laid on 
shared developments, like palatalization and diphthongization, as well as 
on processes that affected only some languages, like nasalization. From a 
strictly synchronic point of view there is no particular reason why pho¬ 
netics or phonology should be treated in a pan-Romance perspective, 
though certain topics that relate to morpheme structure, like accent place¬ 
ment and sandhi phenomena, can profitably be examined cross-linguisti- 
cally. However, some synchronic phonological treatments do mirror quite 
closely diachronic analyses, and past phonological processes not infre¬ 
quently can be viewed as ongoing in contemporary varieties. Morphology 
is frequently closely linked to phonology, especially through morpholo- 
gization processes, but many Romanists concentrate more on the semantic 
and grammatical functions of ‘morphosyntax’, as well as on grammati- 
calization of lexical elements. 

(e) Lexicon 

The lexicon has most attracted Romanist attention. This most idiosyn¬ 
cratic and least systematic aspect of language is also, by that very token, 
the most diagnostic for Romanceness, as the lexical similarity between the 
languages could not have arisen by accident or through common cognitive 
requirements. Some historical link must therefore be assumed - either 
common origin or close interaction. 

For Romance both conditions obtain and are well attested. The common 
origin or proto-language is obviously seen, on the basis of both linguistic 
and historical evidence, as a form of Latin; so tracing the trajectory of 
Latin words into Romance, or reconstructing the origin of Romance 
words, is an activity particularly beloved by Romanists, as is the unravel¬ 
ling of the history of borrowing and new creations. From this perspective, 
not only form but also meaning, and change of meaning, can be exam¬ 
ined, and linked with social and ideological changes in the speech com¬ 
munities. Word formation from native sources is also much studied, lying 
as it does between the idiosyncratic and the systematic levels of language, 
and allowing comparison of the mechanisms used by the different lan¬ 
guages, and of the functions they fill. 
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(f) Syntax 

As I have already pointed out, until recently Romance syntax was rarely 
studied, except in terms of semantics and style, partly because of the dif¬ 
ficulty of obtaining reliable comparative data, except from literary lan¬ 
guages which interact considerably, but also because methodology was 
insufficiently developed to permit plausible extrapolation from the multi¬ 
farious material. The ‘parametric’ approach has, however, given a new 
boost to this aspect of Romance linguistics, even among those who are 
doubtful about its validity. 

(g) Historiography 

A mention should be made of the interest shown by Romanists in the his¬ 
tory of the discipline and its relationship to social and ideological history. 
As we have already glimpsed, speculation about the origin of the lan¬ 
guages and analysis of their structures and social functions dates far back, 
even into the ‘prelinguistic’ era, and much conscious manipulation of lan¬ 
guage use as a social tool is evident in all the standard varieties. 

WHAT DO ROMANISTS ARGUE ABOUT? 

I have already hinted strongly that Romanists are far from unanimous 
about the way the languages should be studied. Conflict has often arisen 
between those who see the discipline as part of the humanities or of social 
studies and those who seek a more ‘scientific’ approach, between those 
who believe that historical factors must be taken into consideration and 
those who attempt to keep to a strictly synchronic line, and between those 
who view language as the property of a community and those for whom 
it is a part of individual cognitive makeup. Indeed, as in other linguistic 
enterprises, we not infrequently find a dialogue des sourds, in which inter¬ 
locutors follow their own line of thought completely oblivious of what 
others are saying. A consequence is that the wheel is constantly being re¬ 
invented - as the history of Romance linguistics demonstrates - while the 
metalanguage that describes its form and function undergoes frequent 
radical transformations. 

Classical Romance philology tended to concentrate on the chronologi¬ 
cally earlier stages of the languages, seeking to unravel the testimony of 
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medieval texts and to reconstruct the linguistic past. The interests of 
medievalists and students of modem Romance varieties are not always 
identical, though they can be complementary. The differences between stu¬ 
dents of dialects and other non-standard varieties and those who limit 
themselves to the literary languages, as between comparativists and experts 
in a single language, can be extreme. However, it seems that Romanists feel 
they have enough in common regularly to gather in congresses and sym¬ 
posia, seeing themselves as a coherent group within linguistics. 

WHAT CONTRIBUTION CAN ROMANCE 

STUDIES MAKE TO SYNCHRONIC AND 

DIACHRONIC LINGUISTICS? 

At one time Romance linguistics was seen as at the forefront of linguistics, 
especially of diachronic linguistics, because of its wealth of primary mate¬ 
rial in an unbroken chronological record. Later the very abundance of 
material, both primary and secondary, was to deter neophytes who would 
find richer pickings in less exploited linguistic seams. Romance philology 
came to be seen as a backwater, rather than part of the mainstream, espe¬ 
cially when synchronic linguistic theory became more highly esteemed 
than language history. The rehabilitation of diachronic, or panchronic, 
linguistics, and the use made of Romance data by linguistic theorists, has 
changed the picture. I shall give three examples of the contribution that 
Romance linguistics has made or is making to more general study of lan¬ 
guage - in the realms of lexical reconstruction, of morphosyntax, and of 
phonology. I shall go into more detail than I can in later discussion, in 
order to give some picture of the richness of the data and the complexity 
of the argumentation. Ancillary information not essential to the main 
thrust of the discussion will appear in small type. 

(a) Lexical reconstruction 

The greatest contribution of Romance studies must surely be to compar¬ 
ative philology reconstructionist techniques, as Romance is the only 
extended ‘family’ with a well-attested ‘mother’ (Ursprache or proto¬ 
language), so that reconstructed forms can be matched with attested 
equivalents. Take, for instance, the word for ‘sweet’, which has cognates in 
all the Romance languages: 
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(1) Rum. dulce, Dalmatian (Veglia) dolk, Ital. dolce, Sard (Logudorian) dulke, 

R-R (Engadine) due, (Friuli) dolts, French doux, Occ. dous. Cat. dolg. Span. 
dulce (Old duz), Ptg. doce. 

Most of the differences between these forms are regularly found in 
parallel correspondences, like the presence or absence of a final -e, of 
preconsonantal / (replaced by u in some of the graphical forms), or of the 
following consonant which may be a voiceless velar (as in Sard and 
Dalmatian), a palato-alveolar (as in Italian, Rumanian and Engadine 
Rhaeto-Romance), a dental sibilant or affricate (as in most other 
languages). The correspondence between o and u for the stressed vowel is 
also regular, except in the case of Spanish, where we should expect o. 
Classic reconstruction from the Romance forms would end up with 
/dulke/: the Latin word for ‘sweet’ is dulcem in the accusative, the case 
form which most often closely matches Romance forms (see 3.3). 

Latin final -m almost never is reflected in Romance, and we have ample evi¬ 
dence that it was not pronounced as a consonantal segment in later Latin. 


(1) Inheritance or borrowing? 

We can therefore be confident that our reconstruction matches exactly the 
phonemic structure of the form attested in texts. The slight discrepancy in 
Spanish is usually explained away as influenced by, or borrowed from, the 
learned Latin form. Sometimes it is hard to detect whether the Romance 
forms are inherited or borrowed from Latin, and we are usually suspicious 
when resemblances between Romance and Latin are too obvious. Take, 
for instance, the word for ‘sad’, which also has cognates in all the 
Romance languages: 

(2) Ital. trlste, tristo, Occ. tritz. Span., Ptg. triste. Rum. trist, Sard (Logudorian) 
tristu. Cat. trist, R-R (Engadine, Friuli) trist, French triste. 

The Latin accusative form is tristem, but some of the Romance forms 
require a reconstruction with final -u, which is attested in a valuable text, 
known as the Appendix Probi. 

(3) TRISTIS NON TRISTUS. 

A nuance of ‘morally reprehensible’ is sometimes attached to the -u ver¬ 
sions in Romance. 
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The modem French form could conceivably represent a feminine /trista/, 
which would have replaced the attested form trist (also tristre). In any case 
the preconsonantal s should regularly have disappeared in central French by 
the modem period (see 7.2): there is an Old French nominative form triz 
(/trits/) and the spelling trite is attested until the fourteenth century, while 
trtste can rhyme with ermlte, and tristre with traitre. So to explain the 
presence of -s in the modem French word, we have to postulate a spelling 
pronunciation probably from the sixteenth century (as in registre). 


(ii) Vowel length 

There is no doubt, however, about the reconstruction of /triist-/ with a 
long vowel, compared for instance with crIsta ‘coxcomb, crest’, which 
has the following reflexes in Romance, dictating the reconstruction of a 
short vowel: 

(4) Rum. creastd, Ital. cresta, R-R (Engadine) krasta, (Friuli) kreste, French Crete, 

Occ., Span., Cat. cresta. (Ptg. crista and Sard krista would allow reconstmc- 
tion of either a long or a short vowel.) 

The problem here is that we do not know whether Latin tristis had a 
long vowel ‘by nature’, as it represents a case of ‘hidden quantity’ - for 
metrical scansion purposes, as it occurs in a ‘heavy’ medial consonant- 
final syllable, it is counted as long ‘by position’ (see 3.3). The Romance 
forms seem to provide the main evidence for a long vowel in this word, 
although some Latinists mention, without citing, epigraphical evidence: 
the etymology of the word is obscure, and graphy provides no clue to the 
vowel quantity. Elsewhere in Latin an s+consonant cluster is usually 
preceded by a short vowel, but here influence from the near synonym 
MAESTUS (with a long diphthong) has been suggested. Some commen¬ 
tators prefer to discard the Romance evidence in this case, regarding the 
forms as early borrowings rather than as inherited. 

A similar niggling worry about vowel length spoils enjoyment of the 
anecdotally amusing history of Romance words for ‘liver’. The Latin word 
lECUR has not survived, but the following (near) cognates are representative 
of the normal words for the ‘liver’, found in a range of Romance languages: 

KvLva.ficat, R-R (Friuli) fiat, Sard (Campidanian) figau, (Logudorian) fikatu. 
Span, higado. Old French (Norman)^rie, French foie, Ital. fegato, Ital. dial. 
(Emilia, Lombardy) fidak, (Abbruzzo) fetteche, French dial. (Walloon) feute, 
Occ., Cat. fetge. 


( 5 ) 
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These words can be linked with the Greek word sykoton, which literally 
means ‘figged’ (Latin ficatus, attested from the fourth century). It refers 
to the habit of feeding animals, especially geese and pigs, with figs, to fat¬ 
ten the liver for culinary purposes. The Latin word for ‘fig’ is (fourth 
declension) ficus, the long vowel of which can be reconstructed from the 
Romance forms: 

(6) IX&l.fico, Old French fi. Span, higo, Vtg. figo 

as well as from the related feminine forms, like Occ. figa, from which 
French figue was borrowed in the twelfth century (masculine fi, fic sur¬ 
vives in dialects, meaning ‘wart’). There is also indirect evidence for a long 
vowel in Germanic borrowings, like German Feig, Dutch vijg. 

In the word for ‘liver’, however, the modern French and Italian forms 
point to a short Latin I (or perhaps a long e taken from a Latin pronunci¬ 
ation of Greek upsilon). Moreover, these forms, like those of Old French, 
Spanish and Logudorian Sard, are accented on the Latin first syllable, and 
not, as we would expect and as happens in Rumanian, Friulian Rhaeto- 
Romance and Campidanian Sard, on the penultimate. A Latin distortion 
of the oxytonic Greek word is blamed for this latter anomaly. 

To make things more complicated there must have been metathesis (giving 
*FITACUS, *FETACUS) to account for some Italian dialects (Emilia, Lombardy, 
Abruzzo), and Occitan and Catalan forms. 


(iii) Proto-Romance lexicon 

Although it is rare for there to be no snags in identifying the reconstruct¬ 
ed with the Latin forms, in the main the textual attestations confirm 
reconstruction techniques, and Romanists are justified to some extent in 
taking the short cut, ‘peeping in the back of the book’, by working down 
from the Latin form, rather than up from the Romance reflexes. 
Sometimes, however, reconstruction gives us information about the proto¬ 
language that the texts do not unambiguously provide. For instance, the 
word for ‘fire’ in Romance corresponds not to ignis but to focus ‘fire¬ 
place, hearth’: 

(7) Rum./oc, Dalmatian (Veglia) fuk, \ta\. fuoco, Sard (Logudorian) fogu, R-R 

(Engadine) fog, (Friuli) fug, French feu, Occ. fuec. Cat. foe. Span, fuego, Ptg. 
fogo. 



Introduction 


15 


Similarly a ‘cultivated field’ is not from ager, but from campus, ‘plain, 
level ground’ (especially a sports field): 

(8) Rum. cnnp, Ital., Span., Ptg. campo, Occ., Cat. camp, French champ. 

Occasionally texts provide us with confirmation of our phonological 
reconstructions. For instance, the word for ‘old’ is not vetus (which gives 
words for ‘rancid, musty’, like Ital. vieto), but apparently /veklu/, with 
palatalization of the medial cluster identical to that found in the words for 
‘eye’ /oklu/, in which there is amply attested syncope (see chapter 3) of the 
short atonic u of OCULUS: 

(9) Rum. vechi (the near synonym batrin < veteranus is a little less common) / 
ochi, Ital. vecchiolocchio, French vieilloeil. Span, viejotojo, Ptg. velhotolho. 
Cat. velUull, R-R (Engadine) veglldgi, (Friuli) vielilvdii (note also Dolomite 
\edllddl - here velars in such clusters regularly become dentals). 

Again the Appendix Probi provides us with a clue: 

(10) V&ULUS NON VECLUS 

(where & is an abbreviation for et) confirms our deduction that in the 
diminutive vetulus, syncopated to /vetlus/, the un-Latin cluster /tl/ was 
replaced by /kl/, which subsequently regularly palatalized in most of the 
languages. 

(b) Auxiliary selection 

A morphosyntactic question that has not ceased to arouse the curiosity of 
Romanists, who have thereby contributed to wider linguistic discussion, is 
that of ‘auxiliary selection’ in the Romance compound tenses with the 
past participle. Later (see 3.3) we shall return to the Romance compound 
perfect; here I look in some detail at only the auxiliary selection question, 
as it raises general questions in syntactic analysis. Though normally a 
HABERE (or TENERE) ‘have’ auxiliary is used in the compound tenses, 
sometimes an ESSE ‘be’ auxiliary is chosen. Some commentators have 
implausibly attributed the use of the esse auxiliary with some intransitive 
verbs to the influence of German, where the distribution of haben and 
sein auxiliaries is similar, but others have linked it to stative and passive 
uses of the esse auxiliary. Most regard the choice as determined lexically, 
with some degree of semantic cohesion, and some believe that it is revela¬ 
tory of a syntactic distinction. 
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(i) Which languages? 

Not all the modern Romance languages can choose a be auxiliary in the 
active perfect. Spanish and Portuguese ceased to do so by the seventeenth 
century, though some Catalan and Aragonese varieties still can do so, 
marginally; 

(11) Cat. (Balearics) som anat ‘I have [am] gone’. 

In ‘unreal conditional’ clauses, be (ser) auxiliary forms were also attested 
during the 1970s in the speech of elderly illiterate Spanish speakers in 
Colombia: 

(12) si mayo fuera sido invierno ‘if May had been winter’ (for . . . 
hubiera sido . . .). 

The tenuous evidence we have for Dalmatian shows that the Veglia vari¬ 
ety in the nineteenth century made no use of a be auxiliary: 

(13) Ju ai venut ‘I have come’, Jai suit ‘I have gone out’, Se jai mar- 
tuot ‘I have got married’. 

In Daco-Rumanian, on the other hand, fi ‘be’ (usually in the infinitival 
form < FIERI?) is consistently used in some of the active compound tenses, 
though avea ‘have’ is required for the most frequent past form, the perfect: 

(14) perfect indicative Am jurat (or jurdt-am) ‘I have sworn, I swore’; but perfect 
subjunctive [S3]^ jurat, future perfect Voi fi jurat ‘I shall have sworn’, past 
conditional A^fi jurat ‘I should have [be] sworn’. 

The past participial form.^cwt, is also used in the popular surcompose plu¬ 
perfect {Am fast jurat T had sworn’, literally T have been sworn’; see 3.2). 

This has supplanted an older type with the imperfect of ‘have’ {Aveam jurat) 
still found in dialects south of the Danube, alongside the (infrequent) syn¬ 
thetic standard form jurdsem. 

There is some trace of use of fi with some intransitive verbs: 

(15) Este plecat ‘He is arrived’, for A plecat ‘He has arrived’; in 
dialects south of the Danube, Sam vinit ‘I am come’, for stan¬ 
dard Am venit ‘I have come’. 

If Old Daco-Rumanian did resemble Old Spanish in its use of a be auxil¬ 
iary in the perfect tense, however, the usage had virtually disappeared by 
the time of the first Rumanian texts in the sixteenth century. 
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Slavonic (specifically Bulgarian) influence is plausibly invoked to explain the 
current usage, which can lead to ambiguity in some tenses: 

(16) voi I asfi chemat ‘I shall/should have called’ or ‘I shall / should have been 
called’. 

In the perfect subjunctive, on the other hand,,fi is conjugated in the passive 
whereas it is invariable in the active: 

(17) sd flu chemat ‘that I have been called’ I sd fi chemat ‘that I have [be] called’. 

In standard French the auxiliary etre is selected by some score of intran¬ 
sitive verbs, most of which are used with avoir in non-standard varieties. 
There is resistance to the switch to avoir with some motion verbs, like venir 
‘to come’ (II est alle ‘He has [is] gone’ commonly being supplanted by II a 
etc ‘He has been’). This is so too with crucial ‘change-of-state’ verbs, 
mourir ‘to die’ and naitre ‘to be born’. Earlier in history, the choice was 
not so clearly lexical and examples like a ale, ‘he has gone’, compared with 
modern II est alle, are found. 

Occitan on the whole follows the same sort of pattern as French, 
though the range of esse verbs is slightly wider. Rhaeto-Romance vari¬ 
eties show a similar distribution, though generalization of habere is not 
so evident in Surselvan as in other varieties, perhaps because of the 
German influence in this dialect. 

Standard Italian makes the greatest and most consistent use of essere 
as a perfect auxiliary. So, as a more clear-cut case, it receives most atten¬ 
tion from linguists. In standard Italian essere may be selected even when 
the verb is preceded by a modal, unlike in French (where similar con¬ 
structions are found in Middle French, there may have been Italian influ¬ 
ence): 

(18) Ital. k andato, E dovuto andare / French II est alle, II a du aller ‘He has gone’, 
He has had to go’. 

Italian modal verbs are called servili, with ‘servile’ adoption of the auxiliary 
associated with the lexical verb. Central Italian varieties are similar to the 
standard in this respect. In North Italy habEre is traditionally used with 
modals, but perhaps through Tuscan influence, esse is found, especially in 
Piedmont, Lombardy, Emilia-Romagna and Liguria: 

(19) Romagna A Jo vld parti ‘I have [am] wanted to go off’ / Venice Go void partir, 
Milan U vursii parti ‘I have wanted to go off’. 
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Non-standard Italian varieties adopt a host of different solutions in aux¬ 
iliary selection. In the south habere tends to be generalized, with varia¬ 
tion in some areas: 

(20) Naples Aggio vemito / So bbenuto, ‘I have/am come’, A muorto / E mmuorto 
‘He has died / is dead’; Calabria Amu venutu ‘They have come’; Sicily A 
murutu ‘He has died’. 

In the north most dialects use esse with rather fewer verbs than the stan¬ 
dard, but some use only habere: 

(21) Liguria A ndet ‘He has gone’. East Veneto El ga tornd ‘He has come back’. 

In several Central and in a few Northern and Southern dialects esse is 
consistently used for transitive as well as for intransitive verbs, though 
sometimes the auxiliary may vary according to the person of the verb 
(often the third and sixth persons select habere): 

(22) Genoa Son dormi ‘They have [are] slept’; Carpignano Sun krumpd ‘I have 
[am] bought'; Novara Mi i son mangid / Lii I'd mangid ‘I have [am] eaten’ / 
‘He has eaten’; Marche (Servigliano, Ascoli Piceno) Sd mazzato lu porcu / 
Issu ha magnato / Issu e vrinutu ‘I have [am] killed the pig’/ ‘He has eaten’ / 
‘He has [is] come’; Latium (Cori) Nu simo magnato / Issi do magnato ‘We 
have [are] eaten’ / ‘They have eaten’. 

One consequence is that the be passive tends to be avoided in these 
dialects. Similar generalization of esse is recorded for some Catalan vari¬ 
eties, apparently with verbs that do not passivize: 

(23) Capcir Sun tingut ‘I have [am] had’; Gerona So bist ‘I have [am] seen’. 


(ii) Semantic factors 

The particular aptitude for esse and habere to grammaticalize as auxil¬ 
iaries, bleaching further their rather colourless lexical content, has long 
been recognized. Some have claimed that in essence they represent gram¬ 
matically predicative and transitive complementary versions of the same 
functional entity. In a few Romance languages (some Francoproven^al 
dialects, for instance) they merge into one set of forms; elsewhere they 
may influence one another morphologically. 

One curious feature of some Eastern French and Francoprovengal dialects, 

also found in Old Italian texts and in Swiss Italian varieties, is that forms 
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cognate with *etre eu ‘be had’ replace avoir ete ‘have been’ (compare stan- 
dard Italian essere stato ‘be been’). 

Some semantic coherence among the intransitive verbs that select an esse 
auxiliary in the compound tenses is readily discernible. The distinguished 
French linguist, Benveniste, for instance, saw etre as signalling a resulting 
‘situation’, and avoir an ‘operation’. No single semantic criterion, however, 
seems to predict which verbs require an esse auxiliary, although state or 
change of state seems often to be involved. Most centrally, and historic¬ 
ally most durably, it is used with verbs of motion or lack of motion, 
especially when the type of motion is not lexically specified: 

(24) French II est alle ‘He has [is] gone’, II est reste ‘He has [is] stayed’. 

In some examples a distinction can be made between stative or telic uses 
of a motion verb and those that place more emphasis on the action itself: 

(25) Ital. Maria e corsa a casa di Gianni ‘Mary ran [is run] to John’s house’ / Maria 
ha corso piu velocemente di Gianni ‘Mary ran [has run] quicker than John’. 

(iii) Unaccusativity 

In recent discussion the relationship of the grammatical (‘surface’) subject 
to the verb has emerged as an important determinant of auxiliary choice. 
In the framework of Relational Grammar, Perlmutter (apparently following 
up a suggestion of Postal) put forward the hypothesis that the single argu¬ 
ment of some intransitive verbs is a derived subject which underlyingly is a 
direct object - functionally a theme, or experiencer, rather than an agent. 

Because such objects do not acquire accusative case, these verbs are 
labelled unaccusative, while other intransitives are (rather confusingly) 
called unergatives. 

Generativists use the term ergative for ‘unaccusatives’, following Burzio, who 
describes them in configurational terms (as having a D-structure frame [e] V 
NP, with the subject base-generated in postverbal position), and who sug¬ 
gests they assign a theta role to the direct object position and not to the sub¬ 
ject position. However the ‘ergative’ cannot assign accusative case to the 
underlying object NP which assumes the status of a subject, acquiring nomi¬ 
native case via an agreement relationship with the verb. 

Just as in passive constructions in most of the Romance languages, in 
which the surface subject is also underlyingly a direct object, the be aux- 
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iliary is associated with promotion of the underlying object to subject 
status. Indeed choice of auxiliary esse has been proposed as one of the 
criteria used for identifying unaccusative or ergative verbs. 

Romance languages, however, provide some instances of the be auxiliary 
being used only when promotion to subject has been effected, whereas the 
‘theme’ can remain in postverbal position with a have auxiliary: 

(26) Occ. (Foix) Qualqus es bengut IA bengut qualqus ‘Someone is come’ / ‘There 
has come someone’; Sard (Lula) Tres pitzinnas sun arrivatas ‘Three girls have 
[are] come’ / B’at arrivatu tres pitzinnas ‘There has arrived three girls’. 

It has been claimed that the choice between be and have auxiliaries for 
intransitive verbs is determined by parametrized principles which can also 
account naturally for subject inversion and use of the adverbial clitic, Ital. ne, 
French en, in Romance. 

(iv) Pronominals 

German bears many resemblances to Romance in the matter of auxiliary 
selection in the compound tenses. However, one important difference is 
that in Italian and French reflexive (or better, pronominal) constructions 
require a be auxiliary, whereas in German haben is used in parallel con¬ 
structions. In Romance a pronominal structure (with a reflexive clitic) is 
used not only for true reflexives and reciprocals: 

(27) French II se tue / Ils se tuent ‘He kills himself’ / ‘They kill each other’: 

but also as a middle for passives (see 4.7): 

(28) Cela se fait! La branche se casse ‘That is (usually) done’ / ‘The branch breaks’. 

There are also lexicalized (‘inherent’) pronominals: 

(29) II s’evanouit ‘He faints’. 

In some languages, too, the third person reflexive (< se) plays the role of 
non-specific human agent (the so-called ‘impersonal’ use, see 6.5). 

Under the ‘unaccusative’/’ergative’ hypothesis, as it applies to Romance, the 
identity of subject and direct object calls for reflexives to be treated in the 
same way as unaccusative intransitives - but this applies only when the 
reflexive element is a clitic pronoun: 


(30) 


Ital. Si e lavato / Se ha lavato, ha lamto se stesso ‘He got washed’ / ‘He 
washed himself. 
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Generalization of esse in pronominals is fairly recent and may indeed be 
connected chronologically with clitidzation of object pronouns. It has not 
spread to most non-standard varieties in either Italy or France. It is to be 
noted too that in Italian esse is selected by reflexives through a modal 
only when there is clitic climbing (see 6.5): 

(31) Ital. Ho voluto levarmi / Mi son voluto levare ‘I have wanted to get (myself) up’. 

The precise structural effect of clitidzation of the reflexive element is much 
discussed, and related to non-reflexive pronominal constructions like: 

(32) Ital. Si e sbagliato ‘He has made a mistake’ (inherent); Si e comprato un rega- 
lo ‘He bought a present for himself’(with dative si); It regalo si i comprato ieri 
‘The present was bought yesterday’ (‘passive’). 

On semantic grounds one would think that passive and inherent pronominals 
should be more prone to act like unaccusatives than true reflexives and recipro¬ 
cals, which ate transitive constructions. That these latter should opt for esse 
causes much puzzlement. Early commentators suggested that the usage derives 
from a Latin impersonal passive construction, like lodatus est sibi ‘it was 
praised to him’, in which the dative pronoun (sibi) was reanalysed as a reflexive. 

Certainly the evidence is that before the modem period French and Italian 
pronominal constructions (whether reflexive or ‘inherent’) selected 
HABERE as often as, or more often than, esse: 

(33) Old French Conan s’a bien defendu ‘Conan has defended himself’; S’a pasme 
‘He has fainted’; Old Ital. S’a segrolato ‘He has ruined himself’; Se I’ha 
sposata ‘He has married her’. 

This still happens in non-standard usage and in dialects nearly every¬ 
where, even where esse is used for unaccusatives. In all the following 
examples a have auxiliary is used: 

(34) Ital. dial. (Asolano, Veneto) Se on lava e man ‘We have washed our hands’; 
Me a 0 godestd ‘I have enjoyed it’; I se gea intosegd ‘They have got angry’; 
Me 6 marida ‘I (female) have got married’ / E nato ‘He has gone’. 

Quite often there is hesitation about which auxiliary to use, though some¬ 
times this may be due to influence from the standard, which sometimes dic¬ 
tates the use of esse when popular usage prefers habere : 

(35) R-R (Friuli) Mi soi vistide / Mi ai vistU ‘I got dressed’ / ‘I have dressed 
myself’; (Surselvan) Jeu sun sescaldaus I Jeu hai sescaldau ‘I have been scald¬ 
ed’ / ‘I have scalded myself’. 
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It is possible here, as my translations hint, that true reflexives prefer habere, 
whereas inherent pronominals select esse, and that a semantic difference is 
signalled by the choice of auxiliary. However, hard evidence is not usually 
available for the modem non-standard varieties. A turn-of-the-century 
description of the Lower Engadine Rhaeto-Romance dialect does suggest, 
though, that true reflexives may be treated differently from inherent pronom¬ 
inals, which act like unaccusatives: 

(36) No eschan schtats, tuormts, ‘We have [are] been, returned’ (unaccusative) / E 
m a miida ‘I have changed (myself)’; Tii I asch lava ‘You have washed (your¬ 
self)’ (reflexive) / Tii t esch schbaglia ‘You have made a mistake (are mistak¬ 
en)’; El s esch arabid ‘He has [is] got (himself) annoyed’ (inherent). 

Surselvan also provides some examples that point in the same direction (but 
here the invariable reflexive marker is agglutinated to the past participle, and 
it can be claimed that the forms are lexicalized as unaccusative or transitive 
respectively): 

(37) II giuven ei sepladius ‘The young man was hired out (has [is] contracted him¬ 
self as a labourer)’; La plaga ei s’aviarta ‘The wound has [is] opened [itself]’; 
La madregna veva sevistgiu de femna veglia ‘The godmother had dressed her¬ 
self up as an old woman’; Quella ha sedostaii ‘That one [female] has resisted 
(defended herself)’. 

Evidence from other Rhaeto-Romance varieties points to the opposite con¬ 
clusion, however: 

(38) Friuli Lor e son Ids ‘They have [are] gone’ (unaccusative) / Si sin sentdz ‘We 
have [are] sat ourselves down’ (reflexive) / Jd si a mitut a vat ‘She has begun 
to cry’ (inherent). 

Similarly, though Friulian optionally uses jessi ‘be’ with true reflexives, in the 
‘impersonal’ construction ve ‘have’ is normal: 

(39) Si a balat tude la gnot ‘There has been dancing [it has danced itself] all night’. 

Sometimes auxiliary choice with pronominals is sensitive to the person of 
the verb, as we saw was the case with unaccusatives in some Italian vari¬ 
eties. In the Francoproven 9 al dialect of Vermes in the Jura, for instance, 
ESSE is used only in first person singular forms: 

(40) L car s'e rate ‘The bus has stopped (itself)’; I mse di ‘I have [am] said to 
myself’. 

It has been suggested that agreement of the past participle with the subject in 
pronominals using habEre in some dialects is a relic of original use of esse: 
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(41) Ital. dial. (San Tommaso, Abruzzo) Ci auamme erreuete ‘We have arrived’ / 
S'a rrauate ‘He has arrived’; R-R (Engadine) Mieus ogls s'han avierts ‘My 
eyes have opened’, (Lower Engadine) Ela s’a la\ada it ceu ‘She has washed 
her hair’. 

However, it is possible that here we have examples of agreement with the 
direct object, as in transitive verbs (see 6.5). 

The traditional view is that the use of esse in true reflexives results from 
a contamination with unaccusative use, through the middle or passive 
pronominals which share some syntactic and semantic characteristics 
with unaccusatives. The spread may have been more marked in the stan¬ 
dard languages as a reaction to the more popular tendency to abandon 
distinction between habere and esse perfects in general. 

There is clearly some connection between the esse choice and the use of 
pronominals with passive value (see 4.7). One as yet unexplored possibility 
is that there is also a connection with the spread in some Romance languages 
of the compound perfect into the territory of the preterite inherited from the 
Latin perfect (see 3.3, 7.3), and with the development of two distinct be aux¬ 
iliaries in some languages that have not retained auxiliary selection (see 7.5). 

(v) Weather verbs 

One question that has been raised with respect to the unaccusative 
hypothesis concerns the status of weather verbs in Romance. In Italian, 
but not in French, such verbs can select esse in the compound tenses: 

(42) Ital. Epiovuto, Ha piovuto / French It a plu ‘It has rained’. 

In some contexts the essere option seems to be the preferred one in stan¬ 
dard Italian: 

(43) £ piovuta una pioggerella fina ‘It has rained a fine drizzle’; Mi e piovuto addosso 
‘It has rained on me’. 

But where the action is emphasized avere is used: 

(44) Ha piovuto tutta la notte ‘It has rained all night’. 

Elsewhere, too, although habere is more frequent, ESSE is possible, appar¬ 
ently emphasizing the result of the phenomenon: 

(45) R-R (Surselvan) Id a ploii tuotta di a tschel ruot ‘It’s [has] poured in bucket¬ 
fuls all day’ / Vara, vara id e ploii ‘Look, look, it’s [is] rained’. 
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In Italy regional dialects vary in their usage. In the north both auxiliaries are 
possible, but in most places habere is more frequent: 

(46) Milan L’e fiukd, Bormio L’e flokd I L’a flokd ‘It has [is] snowed’ / Piedmont 
A I’a fiukd, Venice El ga nevegd. 

Whereas Central dialects use esse, in the south, Sicily and Sardinia we find 
only HABERE; 

(47) Sard A probiu, Abruzzo Ha piovute ‘it has rained’. 

Usually the dialectal treatment of weather verbs is similar to that of other 
impersonals: 

(48) Tuscany E bastato / Venice Ga bastd ‘It has been enough’; Milan M'e piasu I 
Venice El me ga piasu ‘It has pleased me’. 

Interestingly, however, Tuscan uses habere in constructions like: 

(49) Ha fatto piogga ‘There’s been rain’. 

One possible explanation is that weather verbs have as their only argument 
an implicit cognate object (that can become a derived subject, somewhat 
as in ‘The rain, it raineth every day’), and that they should be counted as 
unaccusative. Certainly in French there is a parallel with indisputably 
unaccusative verbs in the impersonal construction: 

(50) II est arrive plusieurs personnes ‘There arrived several people’ t II a pin toute 
la journee une petite pluie fine ‘There rained all day a very fine drizzle’. 

If weather verbs are unaccusative in French, then obviously choice of the 
etre auxiliary is not coterminous with unaccusativity. One problem in dis¬ 
cussing the whole question of auxiliary selection is that in French, as in 
most of the modern Romance languages, the esse option is obsolescent 
and frozen into its place by normative grammar. Older texts, and usage in 
some conservative varieties, hint at more flexibility, perhaps dictated by 
pragmatic requirements. 


(c) Nasalization 

In discussion of the phonological process of vowel nasalization and of the 
status of nasal vowels, evidence from Romance has figured prominently. 
Most influential has been the benchmark provided by the well-document- 
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ed development of distinctive French nasal vowels - in the modem stan¬ 
dard [a], [e], [5] or [6], and more marginally [a^, contrast with their oral 
counterparts (the nasal vowel being diacritically marked in the graphy by 
a silent nasal consonant): 

(51) angeldge ‘angeVrage’ifeintIfait ‘pretends7‘does’; bonibeau ‘goodV'hand- 
some’; brunipreux ‘brownV'valiant’. 

The only other standard language with a salient series of nasal vowels (and diph¬ 
thongs) is Pbrtuguese; vdiere their history is more confused, and their phonological 
status more ambiguous, even though the orthography sometimes picks them out 
even more distinctively by the use of a tilde or til: 

(52) mao Imau ‘handV'bad’; la Ha ‘woolV'there’; somisd ‘soundV'alone’; vendesi 
vedes: ‘you seH7‘you see’; vimivi ‘I came’/‘I saw’; paolpano ‘bread’/‘cloth’. 

Among non-standard Romance varieties phonological vowel nasality is 
rather infrequent, but it is found in seattered dialects all over the 
Romance-speaking area: 

(53) Sard (Sarrabus, South-East Campidanian) [ma?u]/[ma?u] ‘hand’/‘bad’; Span, 
ns. (Andalusia) [kanta]/[kanta] ‘they sing’/‘he sings’; North Ital. dial. (Milan) 
[sa:]/[san]‘healthy’ (m. sg.)/(f. pi.), (Imola) [ke!]/[E:n] ‘dog’/‘year’; 
Francoproven?al [la;na]/[a:no] ‘woor/‘donkey’. 

French and Portuguese creoles make use of the nasal-oral vowel distinc¬ 
tion perhaps rather more frequently than their metropolitan equivalents: 

(54) Haiti [faimJ/lJaim ] ‘room’/‘charm’, panipan ‘hang’/‘rag’; Sao Tome [fudu] / 
[fudu] ‘deep’/‘clean’. 

In Romance phonetic nasality when the vowel is in contact with a nasal 
consonant, which may lose its segmental status, is fairly widespread and 
often acoustically salient, though sometimes frowned on by purists: 

(55) Rum. dens [des] ‘dense’, munti [mtsj] ‘mountains’, am [a\fost ‘I have been’; 
Span. (Mexico) corazon [koraso] ‘heart’; French ns. (Acadia, Canada) meme 
[mem] ‘same’. 


(i) Phonologization 

Phonologization of vowel nasality in French has been much discussed. 
The evidence of assonance and rhyme in medieval texts points to assimi- 
latory nasalization under the influence of a following nasal consonant. 
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and sixteenth-century descriptions of pronunciation leave us in no doubt 
that nasal vowels and diphthongs were phonetically salient at that time. 
However, it is usually assumed that phonologization of the distinction did 
not occur until preconsonantal and prepause nasal consonants (like other 
consonants) dropped out during the sixteenth century, leaving behind 
only the nasal quality of the vowel. 

At about the same time, there began a process of denasalization of 
vowels followed by intervocalic nasal consonants. The truncation of aton¬ 
ic final vowels often left these in final position, so that there arose a con¬ 
trast between nasal vowels (V) and oral vowel + nasal consonant sequences 
(VN), often in morphologically related forms: 

(56) bon ‘good’ (m.) [bo] / bonne (f.) [bon], earlier [bona]; vient ‘(he) comes’ [vje] / 
viennent ‘(they) come’ [vjen], earlier [vjena]. 

Nasal vowels and diphthongs, now cut loose from their close link with the 
VN sequences, collapsed into a smaller inventory, tending to lower and/or 
simplify: 

(57) fin ‘fine’ (m.) [fe], earlier [fin] /fine (f.) [fin], earlier [fTna]; brun ‘brown’ (m.) 
[bro^, earlier [bryn] / brune (f.) [bryn], earlier [bryna]; vain [ve], ‘vain’ (m.), 
earlier [vejn] / vaine (f.) [ven], earlier [vejna]. 

Etymologizing orthography continued to use nasal consonant symbols to 
signal vowel nasalization; moreover in liaison contexts (see 7.2) the nasal 
vowel usually was resolved once more into a VN sequence: 

(58) bon enfant ‘good child’ (m.) [bonofd]; divin enfant ‘holy child’ (m.) [divinafa]; 
vain effort ‘vain effort’ (m.) [venefoR]; un effort ‘an effort’ [ynefoR]. 

In more recent times (particularly in the last hundred years) such resolu¬ 
tion is not mandatory, and nasal vowel + nasal consonant (to) sequences 
have become normal in some liaison contexts: 

(59) mon enfant ‘my child’ (m. or f) [monafa]; en ete ‘in summer’ [anete]; un effort 
[anefoR] ‘an effort’. 

In spite of these anomalies, generative phonologists enamoured of 
‘abstract’ analysis found in French nasal vowels an opportunity to elaborate 
a rule system by which the surface phonemic contrast was subordinated 
to a more ‘underlying’ uniformity. In the so-called ‘classic’ generative 
account, nasal vowels are synchronically derived from VN sequences. 
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as they had been diachronically. In the synchronic account, however, the 
French nasalization rule could apply only in the presence of tautosyllabic 
nasal consonants, and must be followed by a rule truncating preconso- 
nantal and prepause nasal consonants: 

f#l 


(60) 


V -> [+nasal] /_N 


N ->0 


/V 


The diachronic nasalization of preceding vowels by intervocalic conso¬ 
nants, and consequent denasalization, would not be reflected in the syn¬ 
chronic description. 

One snag in this type of account is that the orthographic vowels of atonic 
word-final syllables have to be postulated as present (even though they are not 
pronounced in the modem standard language) in the underlying forms, as they 
are required to account for the non-nasal character of the vowels in forms like: 

(61) bonne [bon], underlyingly /bone/ ‘good’ (f.); viennent [vjen], underlyingly 

/venet/ ‘(they) come’. 


Once the final vowel (usually a schwa which serves as a morphological mark¬ 
er) has done its job it can be eliminated in a low-level phonetic rule. 

Although the nasalization rules proposed were fairly unexceptionable, it 
was the status of schwa (mute e, e muet or e instable) in modern French 
that aroused controversy and provoked more ‘concrete’ generative 
accounts, which often distinguish non-alternating fully lexicalized nasal 
vowels from the morphologically determined alternations between V and 
VN. 

More recently multilinear phonology has neatly proposed descriptions of 
French nasalization by which a nasal consonant in the coda can ‘float’ extra- 
metrically, losing its place and manner features, but transferring its nasality 
to the adjacent nuclear vowel; though the formalization is more satisfactory, 
the analysis has not changed much. The suggestion that so-called ‘nasal 
vowels’ are diphthongal in character receives some support from experimen¬ 
tal data, especially for Portuguese and Canadian French. 

The French example has prompted the widely held view that phonologi¬ 
cal nasalization normally develops from phonetic regressive assimilation 
as a result of the loss of the nasal consonant which provided the trigger¬ 
ing context. In Portuguese, however, vowel nasalization can also be trig¬ 
gered by a preceding nasal consonant: 
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(62) mint for mi, mae < matrem ‘mother’; muito [muintu] < multu ‘much’. 

Discussion often revolves round the question of whether the triggering 
nasal consonant can still be articulated, allowing vowel nasality to be 
derived by low-level contextual phonetic rule. In Portuguese, for instanee, 
where nasal vowels are frequently followed by some nasal occlusion in 
preconsonantal position, they are usually assumed to be synchronically 
derived from VN sequences, though some commentators prefer to regard 
them as lexicalized, and any nasal occlusion as epenthetic. Galician and 
Northern dialects of European Portuguese, which have no nasal vowels, 
can synchronically be viewed as closer to the synchronic underlying forms, 
even though it is likely that in these varieties nasalization was followed 
diachronically by denasalization, sometimes with nasal hardening: 

(63) [la]/[lar)] ‘wool’ (Ptg. la), cais, cans ‘dogs’ (Ptg. caes). 

In Standard Portuguese there is evidence for diachronic denasalization in 
certain contexts, with or without epenthesis of a nasal consonant: 

(64) boa < boa < bona ‘good’ (f); ter < leer < tenere ‘hold, have’; minha < mia 
< MEA ‘my’ (f.); cheio < cheo < plenu ‘full’. 

In Rumanian, the earliest (sixteenth-century) texts, written in Cyrillic script, 
frequently, but not consistently, use the nasalizing symbol t with or without a 
graphical nasal consonant (or its rhotacizing substitute r), presumably to indi¬ 
cate nasalization of a vowel by a following nasal consonant: 

(65) ufbralubra ‘shadow’; priTn ‘through’; mdtre ‘hand’; lura ‘moon’. 

In some cases an etymological nasal consonant has been definitively lost 
in all dialects; lexicalized nasalization and subsequent denasalization of 
the adjacent vowel is usually assumed for an early period: 

(66) cat ‘how much?’ < quantum; cdire ‘towards, against’ < contra. 

The modem Daco-Rumanian nasal vowels are often ignored in synchron¬ 
ic descriptions, as they can readily be seen as surface realizations of VN 
sequences, with or without loss of nasal occlusion. 

(ii) Vowel height 

The French case has also suggested that there is some sort of hierarchy of 
nasalization depending on vowel height, by which low vowels are more 
readily nasalized than high vowels. This widely held hypothesis is based 
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not only on the well-attested lowering of nasal vowels in early modern 
period standard French, but also on the perception that Old French verse 
assonance patterns show separation of an and en sequences from their 
oral counterparts much earlier than is the case for in and un sequences. 
Statistical study of a wider range of texts has shown, however, that this 
perception is illusory and that apparent patterns of assonance are plausi¬ 
bly due to other factors, including the ready availability of certain end¬ 
ings, like present participle -ant, to satisfy verse conventions. Uncertainty 
about the constraints of assonancing conventions, and conflicting evi¬ 
dence from later rhyming verse, add to scepticism about the vowel height 
hierarchy hypothesis. 

As it is likely that at the relevant time French vowel nasality was not yet 
phonologized, one of the factors to be considered is the possibly relative¬ 
ly inconspicuous acoustic salience of nasality with high vowels. In 
Portuguese high vowels are particularly prone to take on nasal resonance, 
even where no nasal consonant is in the environment: 

(67) sim ‘yes’, for si (traditionally thought to be influenced by nao ‘no’ < non, 

but si ‘himself’ is also frequently nasalized in non-standard varieties, here 
said to be influenced by mini ‘me’). 


(iii) Causes of nasalization 

The classic works of Romance philology have sought the cause of French 
and Portuguese nasalizing tendencies in the Celtic substratum, whereas 
Rumanian was deemed to be influenced by Slavonic. One feature that 
these languages share and which has been linked to nasalization is their 
use of schwa-like vowels, that are readily elided, in atonic syllables. The 
strong accentuation that this feature implies might well have the effect of 
reducing the segmental status of nasal consonants. However, as we have 
seen, contextual phonetic vowel nasality is widespread in Romance (as 
indeed is reduction of atonic vowels). It can be maintained that only in 
French (and perhaps not even there) have nasal vowels become lexicalized, 
consequent on other radical changes in the phonological setup - in par¬ 
ticular truncation processes in the early modem period. Phonologization 
(and morphologization) of vowel nasality can be viewed as due to 
reanalysis after denasalization processes had established a contrast 
between V(N) and VN sequences. 
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CONCLUSION 

I have tried to give examples of how study of the Romance languages has 
contributed to knowledge about languages in general. Throughout the 
rest of this book I shall pinpoint discussions within Romance that are of 
interest to linguists in general, rather than attempt the sort of general 
account of Romance that is to be found in the classic manuals. This must 
mean that I neglect some aspects of Romance linguistics and that my 
choice of topic is somewhat subjective: I try, however, to retain some bal¬ 
ance between treatment of phonology, morphology, syntax and lexicon, 
and to take into account social and historical factors. In being selective 
about the topics covered, I hope to look in rather more detail at some of 
them than would be possible in a general survey. 


FURTHER READING 

Posner (1966) is an elementary introduction to Romance linguistics. An 
account of the discipline,with full bibliographies, is to be found in Iordan, 
Orr & Posner (1970). More recent work is covered in Posner & Green eds. 
(1980-93). A useful select bibliography is Bal et at. (1991). A valuable collec¬ 
tive survey of individual Romance languages, together with chapters on 
Latin and Romance, is Harris & Vincent eds. (1988). Encyclopedias, such as 
the Encyclopedia Britannica, Bright ed. (1991) and Asher & Simpson eds. 
(1994), have articles on Romance in general and on each principal language. 
Devoted solely to Romance is Sala ed. (1989). Still in process of publication 
is Holtus, Metzeltin & Schmitt eds. (1988-), which will give synchronic and 
historical accounts of each Romance area and overall surveys of the 
Romance family and its historiography. Each year the Year's Work in 
Modern Languages (published by the Modern Humanities Research 
Association) carries surveys of recent publications on the linguistics of 
Romance and of each individual Romance language. Current work is report¬ 
ed in the proceedings of the triennial international conference of the Societe 
de Linguistique Romane and in the annual publications based on the 
American Symposium in Romance Linguistics, and the German 
Romanistentagen. Periodicals concerned with the Romance languages as a 
whole include the Zeitschrift fur romanische Philotogie, the Revue de linguis¬ 
tique romane, the Revue des langues romanes, Romance Philology, and (since 
1989) Probus. For works mentioned explicitly in this chapter see the general 
bibliography, and for works on individual languages see chapter 5. See also, 
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on the history of Romance linguistics, Bahner (1983), Niederehe & 
Schlieben-Lange (1987), Bossong (1990), and on unaccusativity Burzio 
(1986). On nasals, Tranel (1981). 




PART I 

The similarities 




1 


What is a Romance language? 
Part 1 


1.1 WHAT IS A ROMANCE LANGUAGE? 

If we knew nothing about the history and the geographical location of a 
Romance language, would we be able to recognize it as Romance from its 
linguistic features alone? Certainly, it seems that nothing in the phonetic or 
phonological makeup of those languages we are agreed on calling 
‘Romance’ is particularly distinctive. The most obvious similarity is their 
shared lexicon: I shall leave that for discussion in the next chapter. 

One difficulty that needs to be faced straightaway, though, is that some 
languages are not normally recognized as ‘Romance’ because of their 
morphosyntactic features. In particular, American, African and Asian 
creoles have quite decidedly ‘Romance’ vocabulary. Other languages - like 
Albanian, Basque, Maltese and even Berber - are sometimes regarded as 
even more borderline cases, because a substantial part of their vocabulary 
is identifiably ‘Romance’, but the basic vocabulary is rarely affected, and 
so we treat them as having extensively borrowed from Romance but not 
otherwise closely related. 

(a) Morphosyntactic features 

If we consider creoles - lexically related to French, Portuguese or Spanish - 
as not properly Romance, it is because their morphology and syntax are 
often markedly and consistently different from those of their metropolitan 
counterparts. Though they may differ phonologically from their respective 
lexifier languages, it is their shared morphosyntactic features that lead com¬ 
mentators to assign them to a separate category of languages, that includes 
other creoles, rather than to Romance. In chapter 21 shall be paying particu¬ 
lar attention to these in an attempt to define the bounds of Romanceness. 
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The standard Romance languages share many features that are lacking in 
creoles: for instance, noun gender, agreement, the definite article and, most 
important, morphological verb inflections. Usually the morphological fea¬ 
tures that are most clearly Latin remnants - surviving more intact in ver¬ 
bals than in nominals - are viewed as peculiarly Romance. 

Typologically, however, there is comparatively little to distinguish the 
standard Romance languages from other modern Indo-European lan¬ 
guages. If we compare them with English, for instance, we can note that 
they differ in the use of noun gender, in their richer verbal inflectional sys¬ 
tem, in the apparent absence of a distinct modal verb morphosyntactic 
category, in their distinctive system of clitic object pronouns and in their 
pronominal verbs. But they are similar in retaining morphologically 
marked number in nominals, and in having developed periphrastic perfect 
and passive forms. 

It is usual to claim that Romance word order, in the core languages as 
well as the creoles, is dominantly SVO (subject-verb-object), unlike the 
SOV order of Latin. However, in many varieties ‘atonic’ object pronouns 
precede finite verb forms. Nearly everywhere surface word order is prag¬ 
matically determined, with a tendency to verb-first ordering. In some 
Swiss Rhaeto-Romance varieties (and in Old French) a Germanic-like 
verb-second order is dominant. In most varieties auxiliaries and modals 
precede the lexical verb in compound and periphrastic constructions, as in 
English: Sard and Rumanian are partial exceptions. Standard language 
noun-adjective order does differ from English: although, in unmarked 
sequences, adjectives usually follow the noun, there are many exceptions - 
especially in the literary languages, where stylistic effects can readily be 
achieved by variation of order (see 3.4). 

The comparison with English can, of course, only be by way of illustration: 
several of the features cited as distinguishing Romance from English are found 
in other Indo-European languages. 


(b) Romance ‘archetype’ 

Typological identification of a language as ‘Romance’ is fraught with diffi¬ 
culties, for much will depend on what language is regarded as ‘prototypi- 
cally’ Romance. Some Romance linguists - Rumanian-born Eugenio 
Coseriu in particular - seek to establish, within a ‘holistic’ or ‘integral’ 
typology, what the essential Romance characteristics are. Coseriu’s con- 
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tention is that the Romance guiding principle is that within the nominal 
system syntagmatic (or external) distinguishing features prevail, whereas 
in the verbal system, paradigmatic (internal or morphological) distinctions 
are more important. He goes further and attempts to motivate the differ¬ 
ence functionally, by linking ‘external functions’ - those concerned with 
actualization within the sentence, like case relationships - with ‘external’ 
or syntagmatic expression features, while ‘internal functions’ - sometimes 
more directly related to reality, like noun number or verb tense and some¬ 
times more specifically linguistic, like noun gender - are expressed para- 
digmatieally. 

Coseriu’s ‘Romanee type’ must, however, exclude modem French, which 
has changed since the Middle Ages into a more analytic language, relying, 
for instance, on subject pronouns to mark verb person, and losing the pro¬ 
ductive nominal diminutive formations that persist in other Romance lan¬ 
guages. Coseriu places Rumanian too outside the Romance inner circle, at 
the other extreme from French. I prefer to identify Romanceness in terms 
of an archetype, not in the sense of the most ‘neutral’, but in that of the 
most ‘central’ of the languages - the model to which all the others may be 
most easily eompared. 

Occitan and Catalan are often seen, on morphological grounds, as the 
least distinctive Romance languages, the ‘crossroads’ of the Romance 
world. Catalan is spoken of as bridging the gulf between the closely knit 
Spanish and Portuguese grouping and the more central languages. 
However, this may be no more than a reflection of the conservative com¬ 
promises arrived at in the standardization of Catalan and Occitan. 
Without official status for so long, their different regional varieties 
diverged quite widely. Standardizers attempted to take into account actual 
usage, while looking back to earlier literary conventions. In the case of 
Catalan, influence from Castilian in the modem period has been great, 
though the standard, as taught, attempts to minimize this effect. 

On lexical and syntactic grounds, I regard standard Italian as the prime 
candidate for the status of Romance archetype, as a language which has 
most in common with each of the others. In some ways it is so close to 
Latin that it has even, as we have seen, been mistaken for a colloquial vari¬ 
ety of the Latin literary language. It has been carefully nurtured through¬ 
out its history as an ‘illustrious’ language modelled on Classical Latin, so 
much so that some doubt that it can really be called a ‘natural language’. 
The term standard Italian has even been described as a ‘somewhat ficti- 
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tious label. . . aiming to circumscribe a hopelessly vague concept’, for the 
grammar book version is comparatively seldom heard, unsullied, in uned¬ 
ucated speech in Italy. 

My choice of Italian (the modern geographical surrogate for a historical 
Popular Latin) as the archetype can be challenged as tending to confuse 
synchronic and diachronic criteria: my own view is that a comparison of 
Romance languages must take account of both criteria. 

(c) Degrees of Romanceness 

Round the ‘inner core’ represented by standard Italian, the other languages 
cluster with greater or less cohesion. For instance, the Spanish-Argentinian 
scholar Amado Alonso, following Schuchardt, postulated a Romania con- 
tinua, stretching from Portugal through Spain and southern France and 
into the Italian peninsula, leaving outside its bounds more maverick lan¬ 
guages like French, Rumanian and Sardinian. This, and other attempts at 
sub-grouping, will be discussed in chapter 5: frequently, as we shall see, the 
intensity of ‘Romanceness’ is linked with postulated continuity of a Latin 
comparatively free from violent impact from barbarian tongues. 

A rather different view is taken by the Brazilian scholar Theodoro 
Henrique Maurer, who stresses that many of the similarities of Western 
Romance standards are due to convergence stemming from conscious imi¬ 
tation of Latin models. Mutual influence has also led to convergence. 
Some sort of Sapirian ‘drift’ may also account for common features (see 
chapter 4). 

(d) Romance ‘club’ and Romance ‘family’ 

A distinction has been drawn, within Romance, between ‘organic’ lan¬ 
guages, untouched by conscious manipulation and remodelling, and ‘inor¬ 
ganic’ languages - interlects, or koinai, whether standard or non-standard. 
The definition of a ‘language’ as a ‘dialect with an army and a navy’ can 
presumably be applied only to ‘inorganic languages’. Pure ‘organic’ lan¬ 
guages, however, probably do not exist in complex societies, where we find 
what Coseriu calls ‘historical languages’ with characteristic intricate ‘archi¬ 
tecture’, in which different linguistic systems and social norms interact. 

Although local dialects and the like can still be counted as ‘historical lan¬ 
guages’, we may find it useful to distinguish the exclusive Romance ‘club’ of 
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national and literary languages, from the proliferating family, with its wide- 
spreading ramifications, ranging from the nuclear unit, still at home in 
southern Europe, to members of the extended family, fruits of emigration 
to outposts of former empires. One problem is to determine how far from 
the hub, or core, of Romanceness a language can stray without ceasing to 
be included in the extended family. This may be a sociocultural question 
rather than a linguistic one; we shall return to it in chapter 5. 

These remarks of course apply to any large family of languages; but 
here we are trying to explore further any specifically linguistic traits that 
may be held to identify Romance languages. On the morphosyntactic level, 
I shall here examine verb inflections and noun gender, as these features are 
absent from creoles, which I believe fall decidedly outside the ‘Romance 
type’. These Romance features reflect to a great degree a common Latin 
heritage: others that deviate from Latin will be discussed in chapter 6. 

Although I view creoles as typologically un-Romance, I shall seek to 
demonstrate nevertheless that many of them remain within the Romance 
family, recognized by lexico-phonetic resemblances. Here as elsewhere I shall 
not confine my observations to the Romance club of literary languages, 
which owe much of their separate identity to deliberate differentiation in the 
course of standardization, but which have also retained, or restored, much 
of the Latin tradition and have also interacted culturally and linguistically 
with each other and with Latin at various stages of their history. 

1.2 PERSON MARKERS 

Verb morphology is, I repeat, one of the most diagnostic features for rec¬ 
ognizing a Romance language. Let us take as an example morphological 
person markers in the verb system, which have similarities in all undisput¬ 
ed Romance languages. Nevertheless they present problems, synchronical- 
ly and diachronically, that have not ceased to be discussed at length since 
the beginnings of Romance linguistics, especially in so far as they relate to 
use or non-use of subject pronouns. 

I shall exemplify from the present indicative of regular verbs, at the risk 
of some considerable over-simplification. I cannot even then present sys¬ 
tematically all the ‘philological’ detail, but I shall try to fill out the bare 
bones of the exposition by expanding some points, to give some impres¬ 
sion of the complexities. 
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Table 1.1 Regular present indicative person endings in Latin, Spanish, Portuguese 
and Italian 



Latin 

Spanish 

Portuguese 

Italian 

1 

-o 

-0 

-o[u] 

~o 

2 

-s 

~s 


-i 

3 

-T 

0 

0 

0 

4 

-MUS 

-mos 

-mos [muj] 

-iamo 

5 

-TIS 

-is 

-is DJI 

-te 

6 

-NT 

-n 

-m [-] 

-no 


(a) Spanish, Portuguese and Italian. 

In the ‘inner core’ Romance standards, as in Latin, maximally six persons 
are morphologically marked by segmental bound inflectional endings, and 
subject pronouns are used only for some degree of emphasis. For ease of 
reference, I use the labels ‘persons 1-6’ (p.1-6), but it is sometimes more 
revealing to think of persons 4-6 as the plural of persons 1-3. 

For the present indicative, which tends to have the richest system of per¬ 
son inflections, these mainly reflect regular derivation from Latin. The reg¬ 
ular endings are as shown in table 1.1. Except in person 1, the Latin ending 
is preceded by the theme vowel: A, E or i. The accent falls on the root 
except in persons 4 and 5 ( e.g. -amus, -atis), when a long theme vowel is 
accented (in Latin, accent regularly falls on the penultimate syllable unless 
its vowel is short and is in a free syllable: see 3.3). The three Romance lan¬ 
guages cited continue, on the whole, the Latin forms, though they use a 
zero marker for person 3, having lost Latin -t, graphical reflexes of which 
are found in very early vernacular texts. 

The picture is, however, much more complicated than this schematic 
account suggests. Some remarks are in order to give a more accurate pic¬ 
ture of what is going on. 

(i) Spanish and Portuguese person 5 

This was marked by -des < -tis in the older language, as in some modern 
dialects. This phonetically regular ending gave way to the modem one in 
the standard language. It persisted until the seventeenth century, when 
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stress fell on the antepenultimate syllable (as in the imperfect cantdvades), 
but in the present indicative its decline began in the fourteenth century. In 
Portuguese -des is still used in the so-called personal infinitive and the 
future subjunctive (e.g.falardes). We shall see later that person 5 is replaced 
by other forms in some Latin-American varieties. 

(ii) Modern Italian person 4 

The standard -iamo ending is used for all conjugations (compared with 
maximally differentiated -amo, -emo, -imo found still today in some 
dialects). The explanation usually given is that -iamo has spread analogi¬ 
cally from a subjunctive, either of the i-conjugation, or via the ‘be’ form 
siamo (for siMUs). Most dialects generalize -amo or -emo. 

(iii) Italian person 6 

The -no ending in Italian is found in Tuscan texts from earliest times and 
is often said to be an analogical extension from sono (sunt > son) ‘they 
are’, which in turn is crossed with sono (sum > so) ‘I am’. This latter 
allegedly acquired an -o, analysed as the characteristic ending of person 
1. The frequent alternation of ‘full’ versus apocopated forms in person 1 
lends support to this hypothesis. Other possible sources for the -o of person 
6 may be analogy with the personal pronoun loro (< illorum), though the 
use of this form as a subject pronoun probably postdates extension of -o 
to person 6, and there is no clear geographical correlation between the 
two developments. Analogy with the simple past -ero ending (e.g. diedero 
< dederunt) is also possible. It is not irrelevant, perhaps, to note that 
Central and Southern Italian show a preference for vocalic word-finals. 
Southern dialects also use a vocalic final in person 6 (Sicilian -anu, -inu). 
This is not so in many North Italian dialects, where the person 6 ending is 
identical with person 3, though often obligatory clitic pronouns distin¬ 
guish them. In old Northern texts, however, -n marks person 6, as today 
in Veneto: 

(1) Cdn/en‘They sing’ 

In standard Italian e- and i- conjugations o replaces the theme vowel, perhaps 
also by analogy with sono: 


( 2 ) 


Temono ‘They fear’ (temere), Sentono ‘They hear’ (sentire). 
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In old Sienese -ono is also used in all conjugations, but in modern Florentine 
-ano sometimes occurs in the e- and i- conjugations. Other Tuscan dialects gen¬ 
eralize -eno or -mo), tending to distinguish -ano (indicative) from -ino (subjunc¬ 
tive). 


(iv) Portuguese person 6 

This is marked by nasalization of the thematic vowel: a regular develop¬ 
ment consequent on the loss of the nasal consonant. In Lisbon usage a 
nasal diphthong - [nw] am, [ej] em - is pronounced; in Brazil, however, 
nasal quality is often lost, so that person 6 is identical with person 3. 


(b) Catalan, Occitan and Sard 

The Catalan and Occitan literary languages also morphologically distin¬ 
guish six persons in the present indicative, as do Sardinian varieties. The 
endings in table 1.2 look very similar to those of Italian, Spanish and 
Portuguese, though how far they represent the reality of unfettered collo¬ 
quial usage has been questioned. 

Table 1.2 Regular present indicative person endings in Catalan, Occitan and Sard 



Catalan 

Occitan 

Sard 

Logudorian/Campidanian 

1. 

-0 

-ii-e 

-ol-u 

2. 

-s 

-s 

-s 

3. 

0 

0 

-t 

4. 


-m [n] 

-musl-us 

5. 

-u 

-tz 

-desl-is 

6. 

-n 

~n 

-nl-nt 


Consonant-final Sard forms very frequently - some say regularly - have 
an added paragogic vowel that is identical to a theme vowel: 

(3) Campidanian Cantauzu ‘We sing’, Cantaizi ‘You sing’, Cantanta ‘They sing’. 


In some Catalan and Occitan varieties persons 1 and 3 are not clearly distin¬ 
guished. The standard Catalan -o for person 1 is a comparatively recent 
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acquisition found in Central and Western dialects. Elsewhere person 1 is 
sometimes identical with person 3. In Occitan, too, the persons 1 ending -U-e 
(or, sometimes in Proveneal, -o) is not consistently used, except when the 
form would otherwise end in a consonant cluster (as in cohre, cobri ‘I cover’). 
In the old literary language persons 1 and 3 were identical except in the a- 
conjugation. 

Many modern Occitan varieties - since the seventeenth century perhaps 
through French influence - have tended to generalize use of the subject 
pronoun especially with person 3 forms: 

(4) Lome elplora ‘The man weeps’, Elplou ‘It is raining’. 

The bare verb form, however, can be used, apparently with affective 
import, especially when the lexical subject is in focus: 

(5) plou de peiras ‘it’s raining stones’, canto I'ausel (French (a cliante, I’oiseau) ‘it’s 
singing, the bird is’. 

(c) Rumanian 

The standard Rumanian language does not normally distinguish more 
than five personal endings in the present indicative, but does not obligato¬ 
rily make use of subject pronouns. In the a-conjugation, person 3 is identi¬ 
cal with person 6: 

(6) Ctnla ‘He sings, they sing’ / Cint ‘I sing’: 

whereas in the other conjugations person 1 is the same as person 6: 

(7) Tin ‘I, they hold’. Dorm ‘I, they sleep’. 

In some Istro-Rumanian varieties numerous verbs do distinguish six persons in 
the present indicative. The other standard Rumanian person endings are as fol¬ 
lows: 

Regular present indicative person endings in Rumanian 
2 -i, 4 -m, 5 -(i [ts‘] 

The Rumanian p. 2 -i ending (a graphy used for [j] after vowels or marking 
palatalization of a preceding consonant) is reminiscent of Italian: for both 
these languages some see the palatal vowel or glide as a phonetic development 
of final Latin -s, or even of -as (see 6.2). 
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(d) Rhaeto-Romance 

The Rhaeto-Romance varieties sometimes distinguish six personal end¬ 
ings; in the Engadine, for instance, persons 1 and 3 differ in that the latter 
retains the thematic vowel: 

(8) /^‘l lift’//eva "he lifts’. 

The -i' (or [J]) person 2 ending is most often reinforced by a -t that pre¬ 
sumably derives from an enclitic pronoun (tu) and which is sometimes 
extended to person 5. 

Thus at Santa Maria, in the Lower Engadine, person 2 in the a-conjugation 
has an atonic ending [ajt] compared with the person 5 tonic -['at]: 

(9) Idvast (sg.) / lavdl (pi.) ‘you wash’: 

whereas at nearby Remiis the comparable endings are p. 2 [ajt] and p. 5 [' aj- 
vat], where TU and vos elements seem to be incorporated (compared with, for 
instance, Zernez where we find -[aJ]/-[' ajs] and Zuoz, Upper Engadine, where 
the endings are -[aj’t]/-['ajs]). In some Surmeiran varieties the retention of 
etymological [s] rather than the [fj characteristic of /st/ clusters suggests a 
morpheme juncture between the two elements. 

In some Upper-Rhine Surselvan varieties - as at Tavetsch - persons 1 and 3 
are identical. Many others (especially in the north-east of the region) have 
added an -I to person 1 in the present and imperfect indicative, and this 
ending is adopted by the standard. In all Surselvan dialects the use of the 
disjunctive pronoun is required, even when the persons are distinguished 
by inflectional endings: 

(10) Brigels Jeu Idval ‘(I) wash’ / El Idva ‘(he) washes’. 

Sporadic evidence for the suffixed -/ (not found in the earliest texts) dates from 
the beginning of the eighteenth century. It has been suggested that it repre¬ 
sents an enclitic pronoun object (< illum) or that it is introduced by analogy 
with verbs with an / in the stem (e.g. affel < afflo). Although the former sug¬ 
gestion may appear unmotivated, we may compare the ‘pleonastic’ use of a 
feminine (originally neuter?) pronoun la in, for instance, Florentine: 

(11) O che lafai? ‘what are you doing?’ lo laprendo il treno ‘I am taking the train’. 

It is possible that in Surselvan, which no longer uses atonic pronouns, the -/ is a 
relic of a similar former ‘pleonastic’ enclitic pronoun. Another suggestion is 
that the -el element represents a hypercorrect reformation of borrowed Italian 
-o, parallel with e.g. staunchel = Ital. stanco ‘tired’. However, an -el adjectival 
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and nominal suffix (often originally from -i us) is very common in this variety 
and there is no good reason to associate it with Italian -o. More plausible mor¬ 
phologically would be the agglutination to the verb of an obsolete enclitic per¬ 
son 1 pronoun, parallel to disjunctive jeu. There is, however, no phonological 
evidence to support the postulated change [ew] > [ol], even though hardening of 
glides (to velars) is regular in some localities, and the change of [aw] to [al] in 
some contexts is found in a number of North Italian dialects. 

Some marginal Ladin dialects also require use of the disjunctive pronoun 
apparently to distinguish homophonous persons 3 and 6: 

(12) Padola (Comelico) Lujs lava ‘He gets washed’/ Lueris lava ‘They get washed’. 

Dolomite varieties, however, regularly use a subject clitic for persons 2,3 and 
6, whereas p. 1 does not require a clitic: 

(13) Ampezzo lave I te laves (compare Padola lavi llavi)! I lava I i lava‘I I you wash / 
he washes / they wash’. 

In the literary variety of Friuli, where subject clitics are regularly used, 
persons 1 and 3 are identical except in the a-conjugation: 

(14) ila ciol ‘I take, he takes’, but Icianti ‘I sing’ / A ciante ‘He sings’. 

The -/■ person 1 ending often found in North Italian varieties is thought by some 
to derive from an enclitic subject pronoun (cf. Ital. io). 

(e) French 

Modem spoken French seems to be furthest from the Romance archetype, in 
that inflectional person endings are even less consistently used than elsewhere. 
In the present indicative of most verbs persons 1, 2, 3 and 6 have zero mark¬ 
ers. Persons 4 and 5 are, however, differentiated by [5] and [e] endings, and 
sometimes by difference of stem. The graphical form reflects to some extent 
an earlier stage of the language which did apparently differentiate six persons: 

(15) chante (Old French chant), -es, -e, -ons, -ez, -ent. 

Loss of unstressed final vowels and of consonants, completed by the 
beginning of the modern French period (early seventeenth century), led, in 
the standard language, to homonymy of all the stem-stressed forms, and 
by the same period subject pronouns came to be obligatorily used, except 
when there is a noun subject. Dialects sometimes distinguish singular from 
plural verb forms inflectionally, rather than by different pronouns: 
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(16) Je chante / Je chantons ‘I / We sing’. 

Some generative phenologists have maintained that (highly abstract) 
underlying forms still distinguish the different persons in standard French. 
Most scholars agree, however, that the burden of distinguishing the per¬ 
sons has shifted from the verb endings to the proclitic subject pronouns. 
Even when there is a noun subject, popular varieties of French use fairly 
consistently the ‘pleonastic’ subject pronoun: 

(17) Mon pere il chante ‘My father [he] sings’. 

This is similar to what happens in many dialects of north Italy and neigh¬ 
bouring areas, where the pronoun can be virtually obligatory: 

(18) Tessin ul me pa al canto ‘[the] my father [he] sings’. 

The preverbal i in some Indian Ocean creoles can be seen as an extension of 
such constructions, though other origins have been suggested. 

Some would maintain that examples like (17) and (18) involve left disloca¬ 
tion, by which the noun subject is placed in an extraposed topic position, 
to be echoed by the pronoun within the clause proper. Others claim that 
the pronoun is part of the verb inflection. Much depends on judgements 
about intonation patterns, and on the possibility of use of the pronoun to 
echo indefinite noun subjects (like ‘someone’ etc.). In standard French, for 
instance, this does not happen, whereas in many dialectal varieties it does: 

(19) French Quelqu'un (*/7) a demande, Francoprovenijal (Val d’Aosta) Quatsun Vat 
demandd ‘Someone has asked’ (where, admittedly, the / could be a subject or an 
object clitic): French dial. (Picard, Demuin) Chacan i a ker I’indrot add qu’i est 
vnu au monne / standard Chacun (*il) a le cwur Id ou il est vena au monde 
‘Everyone has his heart in the place where he came into the world’. 

Note, however, that a clitic subject apparently echoes personne ‘no-one’ in 
questions in standard French: 

(20) Personne ne vient-iP. ‘Is nobody coming?’ 

In non-standard French, too, clitic copying with indefinites is also not 
unknown (see the film title Tout le monde il est beau, tout le monde il est 
gentil ‘Everybody [he]’s good-looking, everybody [hej’s nice’). 

In both Erench and North Italian varieties, unlike standard Italian, a 
‘dummy’ (expletive) pronoun is used in impersonals, where no logical sub¬ 
ject is involved: 
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(21) French IIpleut, itfaut I Ital. Piove, bisogna ‘It is raining’ / ‘it is necessary that.. 

In French, even with the clitic subject, person 3 is identical with person 6, 
except when the pronoun is followed by a vocalic initial: 

(22) [i'fat] Il(s) chante{nt) ‘He sings, they sing’, but [i' lem] ilaime ‘he loves’ / [i ’zem] 
Us airnent ‘they love’. 


(f) Anaphoric and deictic pronouns 

Persons 3 and 6 are, as we have seen, homophonous in some other 
Romance languages, and, normally, these persons would, in the absence of 
a lexical subject, call for a ‘substitute’ anaphoric subject pronoun. 
However, in those Romance languages in which use of the subject pronoun 
is not obligatory it is the discourse or deictic pronouns - first and second 
persons - that are most frequently used, and not the third-person pro¬ 
nouns. This usage is presumably pragmatically motivated, in that the dis¬ 
course pronouns closely implicate the interlocutors. 


(g) First and second persons plural 

Yet, morphologically, it is persons 4 and 5 that appear to be most often 
clearly differentiated, not only by distinctive endings, but also often by 
stem changes that historically owe their origin to the Latin accentuation 
pattern (see 7.3): 

(23) French il vient I nous venons / vous venez, Ital. viene / veniamo / venite. Span. 
viene / veninios I venis ‘he comes / we come / you come’. 

Persons 4 and 5 behave in unexpected ways in several languages. We have 
already mentioned that Italian has generalized the ending -iamo in all con¬ 
jugations. French, too, has adopted a uniform ending for each of these 
persons: -ons is sometimes said to come from sumus (French sommes ‘(we) 
are’) or, perhaps, from a deviant development of -AMUS (Old French -ens), 
while -ez is a regular outcome of -axis. 

In the Spanish and Portuguese standard languages persons 4 and 5 of 
the present indicative are the only finite forms that distinguish three conju¬ 
gations rather than two: 

(24) p.4: -amos, -emos, -iinos; p.5: Span, -dis, -eis, -is I Ptg. -ais, -eis, -is. 
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One may well wonder what it is about these two persons that calls for spe¬ 
cial characterization. Perhaps it is something to do with their semantic com¬ 
plexity - ‘we’ can indicate the inclusive ‘I and you’ or the exclusive ‘I + he or 
they’ (a distinction not grammaticalized in Romance), while ‘you’, originally 
plural, can often be used as a polite form of address to a single person. 

In several languages, though, there seems to be a tendency to avoid use of the 
morphologically marked persons 4 and 5 forms (see below). 


(i) Honorifics 

Some languages prefer a person 3 honorific form, with third person verb 
agreement, to the polite use of the person 5 form in addressing a single 
individual: 

(25) Span, usted (originally vuesira merced ‘your grace’) and also e/, ella in the older 
language and in dialectal use; Ital. dial. (Genoa) vosd (from vostra signoria 
‘your lordship’); Ptg. o senhor ‘the lord’, voce (from vosse merce ‘your grace’), 

Rumanian Dumneatd {Mata for short) or less familiar Dumneavoastrd are 
used with person 2 and 5 verb forms respectively. 

Italian voi (person 5 pronoun) and Lei (feminine person 3 pronoun) are 
today often regarded as regional (South and North) variants of the hon¬ 
orific pronoun, but they have apparently always alternated, without clear 
pragmatic differentiation. 

Interestingly enough, some Louisiana (cajun) French varieties differentiate the 
polite vous person from the familiar plural vous autres ‘you others’ (see 2.2): 

(26) Vous chante I Vous-autres chantez or vous-autres fa chante ‘You sing’ (sg./pl.). 

Another isolated related, and now virtually obsolete, American French dialect, 
that of Old Mines, Missouri, does not retain even traces of the -ez person 5 
ending, except in a few fossilized forms. A distinction in person 5 pronouns 
similar to that of cajun is found in Friulian, the polite vo form being used in 
addressing a friend or relative, and voaltris as a plural. However, third person 
forms are used to address strangers or superiors. 

(ii) Voseo 

Some American Spanish varieties apparently make no use at all of the per¬ 
son 5 forms, employing the subject pronoun vos apparently with person 2 
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verb forms (a phenomenon known as voseo) to address single individuals. 
However, the stress pattern of some of these verb forms, as in the Plata 
region of Argentina, suggests that there may be person 5 rather than person 
2 agreement: 

(27) Vos cantos (< cantdis? rather than cantos) ‘You sing’. 

In Argentina vos is apparently used with person 2 agreement in most tenses: 

(28) VOS cantards, vos has cantado, vos cantatas 

but in the preterite popular usage adds a -s marker to the -dste person 2 
verb form (in place of the standard person 5 -asteis form): 

(29) Vos cantastes (popular) / cantaste ‘You sang’. 

Elsewhere (for instance, in Chile) person 5 verb forms are certainly used in 
voseo. 

Perhaps significantly. In many of the voseo dialects syllable-final s is phoneti¬ 
cally realised as a spirant, or has disappeared altogether (see 6.2). For the plur¬ 
al, ustedes (with p. 6 agreement) is used in voseo dialects. In West Andalusia the 
same form, a honorific, is used with a person 5 agreement: 

(30) Ustedes cantdis ‘You (pi.) sing’. 

(iii) Person 4 

The first person plural inflected form is apparently avoided in popular 
varieties of several languages. In colloquial French, for instance: 

(31) On chante ‘One sings’ is frequently substituted for nous chantons; nous on va 
‘we are going’. 

The on form (see 6.5) seems to be the only colloquial person 4 form in 
most American varieties. 

In colloquial Tuscan varieties of Italian the use of a person 3 pronomi¬ 
nal (reflexive) form has virtually replaced the person 4 form: 

(32) Not si canta for Cantiamo ‘We sing’. 

Similar forms are found elsewhere in Italy (especially in the north), even 
where the inflectional ending is well preserved. 

In some popular Brazilian Portuguese varieties the person 4 -mos end¬ 
ing is virtually unknown, the person being marked by an obligatory, pre¬ 
posed pronoun: 
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(33) Nos deve for devemos ‘We ought’. 

Even where the characteristic -mus person 4 ending appears to be intact, 
there is sometimes confusion with the nos ‘we’ pronoun. In some lan¬ 
guages such confusion is undetectable as it is overlaid by other phonologi¬ 
cal changes, but in parts of south-west France, north Spain and central 
Italy it looks as if the inflectional and pronominal morphemes have tended 
to fuse. In the pronoun not infrequently the initial phoneme appears as Iml 
(a phenomenon found also in south-east France). 

Even more often the inflectional ending could conceivably be a reflex of 
the NOS pronoun, rather than of the -mus ending. This is particularly so in 
Spain (examples from Leon, Aragon and Asturias have been cited, most 
frequently in the imperfect indicative and the present subjunctive) and in 
American Spanish. The -nos forms (as distinct from the -mos forms) may 
be stressed on the stem, not on the thematic vowel: 

(34) North American (chicano) Span. Trabdjenos / standard Trabajemos ‘Let’s 
work’. 

This stress patterning suggests that the ending may have been reanalysed 
as trabdje (person 1 or person 3) + nos. 

Old Tuscan -no person 4 endings were frequent from the thirteenth cen¬ 
tury: analogy with person 6 is the favoured explanation. For instance, in 
the attested medieval form: 

(35) Penseno ‘We think’ 

where there is no stress marking in the graphy, it is possible stress was on 
the stem as in person 6 (penseno). But similar forms survive in varieties of 
modern Tuscany, with the stress on the ending. It is possible that here we 
have enclisis of the no ‘we’ pronoun to an apocopated person 4 form. 
Thus: 

(36) faciem(o) + no> facieno = lla\.facciamo ‘we do’ (compare R-R (Friuli) mangin 
= Ital. mangiamo ‘we eat’). 

We note, however, that in Liguria persons 4 and 6 of the imperfect indica¬ 
tive can be identical: 

(37) Mingdven = standard MangiavdmolMangidrano ‘We/they were eating’. 

Shift of stress in the inflected person 4 form may also occur in the North 
Italian Cremona dialect: 
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(38) Kantum!Kdntum ‘We sing’. 

The stem-stressed form seems to be associated with: 

(39) Um kdnta ‘One sings, we sing’. 

In French there may once have been similar associations: see example (31) 
above. The interaction of person endings and person pronouns, as shown 
by analogical reformation of one or the other, by reanalysis of the endings 
as pronouns, or the fusion of pronouns to the verb as inflections, occurs 
sporadically throughout the Romance area, more often in non-standard 
varieties than in the standard languages. 


(h) Person markers and subject pronouns 

In none of the creoles are there person inflections, but an invariable lexical 
verb form is obligatorily preceded by a disjunctive non-case-marked per¬ 
sonal pronoun, except when a lexical subject is present. Compared with our 
‘archetypical’ Romance language, the typological difference appears radi¬ 
cal. 

We have seen, however, that within the Romance languages there is a 
shading which, at the outer limit, no longer presents sharp contrast with 
the creoles. Some would place the dividing line between those languages 
that still rely primarily on inflectional endings to mark the person of the 
verb (identified broadly with the null subject or pro-drop languages of cur¬ 
rent generative studies), and those that make use of preposed subject pro¬ 
nouns: French, Northern Italian, the Spanish of San Domingo and the 
Portuguese of Brazil would then fall on the side of the creoles. 

The hypothesis of a difference in parametrical setting that divides the 
Romance languages into two groups on this basis has encountered many 
empirical problems, and has led to refinements of the hypothesis that 
sometimes appear to lack close direct links with the data. For instance, a 
distinction is made between ‘phonological clitics’ in North Italian vari¬ 
eties, where the pronominal form is treated as part of the inflection, and 
‘syntactic clitics’ in French, where the pronoun is assumed to occupy a 
syntactic slot of its own. If we take into account non-standard usage, this 
distinction is not so clear-cut. 

Here we should return to the earlier remark that the modern Romance lan¬ 
guages may be moving to verb-first (VI) ordering. If the subject clitic does 
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indeed become a verb inflection then in many cases a change from SV to VI 
ordering has been effected. 

Undoubtedly phonetic attrition of personal inflections favours increase in 
use of pronouns. But the direction of causation is not uncontroversial; 
indeed historical evidence seems to indicate no direct chronological correla¬ 
tion between loss of inflections and obligatory use of overt pronouns. After 
all, as we have seen, some languages have buttressed their doomed inherited 
inflections with postposed vestigial pronouns, and some use clitic or non¬ 
clitic pronouns obligatorily, even though most inflectional endings are kept 
distinct. As an example of belt-and-braces differentiation of persons, we can 
cite a Lombardy paradigm, which distinguishes person endings, in which at 
least one pronoun is obligatory, either the disjunctive (mi canti), or the clitic 
(a canti), but often both are used {mi a canti). The proclitic pronoun, in some 
varieties, differentiates persons 2,3 and 6, and occasionally person 1. 

Lombard person markers 

mi a (Imi) kanti; ‘me he (/I) I-sing’; 
ti ty (la) kdntaf, ‘you you (/he) you-sing’; 
ty at (Id) kdnta\ ‘him he sings’; 
le la kdnta' ‘her she sings’; 

ny (m) a kantom ‘we he we-sing’/ ny (m) am kantawe = ‘we one sings’; 
vy / vialtri a kantd(f) ‘you/you-all he you-sing’; 
lor i (la) kdnta ‘them they (/he) sings’. 

Indeed, here, the hypercharacterization can be so marked that one col¬ 
league has called it ‘belt, braces and hands’. 

Other North Italian varieties also make similar use of pronouns though not 
always so consistently. In most Veneto varieties a clitic pronoun is obligatory 
with only persons 3 and 6, which have identical verb endings: 

(40) ell I crede ‘he believes / they believe’. 

In Venice clitic ti is obligatory also with person 2; however, the person 3 clitic is 
not used with ‘impersonals’, or ‘indefinites’, nor, curiously, when the lexical 
subject is postposed to the verb (though note that in this example there is an 
unaccusative verb): 

(41) Toni el ze ri\d I Ze riva Toni ‘Tony has arrived’. 

It has been argued that in this variety the person 3 clitic is an argumental ele¬ 
ment which cannot be associated with the expletive null element that is 
assumed to occupy subject position (e.g. in unaccusatives). In varieties of 
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Central Veneto an invariable particle a accompanies persons 2 and 3 clitics, 
apparently always verb-phrase initially: 

(42) A te ve via ‘You are going away’, A veto via'! ‘Are you going away?’, A no te 
parli mai ‘You never speak’. 

It is also used with ‘impersonals’, like weather verbs; whether it is better viewed 
as a pronoun or as a verb-phrase marker is debatable: 

(43) A nevega ‘It is snowing’. 

In the dialect of Genoa, like that of Venice, a clitic pronoun is obligatory with 
persons 2,3 and 6, but also with the polite form - see example (25) above - used 
with persons 3 and 6 agreement: 

(44) [(vu ’Ja) fa sa] = Ital. Lei sa ‘you know’. 


Pro-drop in French 

The problems posed by the history of French person markers has in recent 
times sparked off copious research activity, as the transition from pro¬ 
drop to non-pro-drop was apparently made in fairly recent times (complet¬ 
ed by the seventeenth century). For French, unlike most North Italian 
varieties, there is ample documentation of all kinds and for most periods. 
The main facts of the case have long been known: in Old French texts sub¬ 
ject pronouns are not used in contexts in which they would be obligatory 
in modern French. Old French texts also tolerate more variation in word 
order and show a richer verbal morphology than modern French, and so it 
has frequently been assumed that Old French was more ‘prototypically’ 
Romance than the modem language. 

The Old French of the early texts has, however, apparently non- 
Romance syntactic features that some have linked to putative Germanic 
influence (presumably owing to the prestige of the Franks, who constitut¬ 
ed the ruling class in north Gaul after the disintegration of the Western 
Romance Empire). These features include the apparent predominance of 
the so-called verb-second (V2) word order that is usually assumed to be a 
defining characteristic of Germanic languages. The link between V2 and 
non-use of pronoun subjects in Old French texts has long been recognized. 
Where we would expect a subject pronoun to follow the verb, displaced by 
another element preceding the verb in the clause, most frequently no overt 
pronoun is used (or ‘the pronorm is omitted’). 
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In modem French V2 ordering is no longer extant (except in some fossils - 
like ainsi soit-il ‘so be it’). Attempts to trace in texts a direct link between loss 
of V2 and obligatory use of pronouns have not been completely successful. 
That there is no simple causal connection is suggested by comparison with 
those Surselvan Rhaeto-Romance varieties that have V2 and a rich verb mor¬ 
phology, but also obligatorily use disjunctive case-marked pronouns: 

(45) Jeu cumprel, ti cumpras, el cumpra ‘I, you buy, he buys’. 


Sociolinguistic factors 

One thing that has to be borne in mind is that modem French is highly stan¬ 
dardized. Seventeenth-century linguistic arbiters decreed that variants that 
permitted non-use of subject pronouns were unacceptable and archaic, on 
the grounds that they were potentially ambiguous. Texts before this time 
show more variation, but documentation of truly colloquial usage is hard to 
come by. So we can only extrapolate from incomplete evidence, testing 
hypotheses that can be formulated within the framework of current theories. 

Standardization in this instance, as in others, had the effect of distanc¬ 
ing French from the inner circle of the Romance ‘club’. Seventeenth-centu¬ 
ry fashionable society in France swept aside tradition and fostered radical 
modernity. The ideal of clarity and elegance in language use eschewed 
ellipsis and favoured overt marking of sentence constituents. 

French subject pronouns by the seventeenth-century were always pro¬ 
clitic to the verb and could not receive stress. Stressed disjunctive forms 
that were originally non-nominative (like moi, toi < me, te) were used for 
emphasis, alongside the clitics: 

(46) Moi, je chante, Toi, tu chante ‘/ sing, you sing’. 

In the modern standard, exceptionally, it is still possible to use the third 
person disjunctive pronoun without a clitic, usually with adversative 
import: 

(47) Ltii vient et non elle ‘He’s coming, and not her’. 

It is the disjunctive forms, not the clitics, that yielded the only personal 
pronouns of French creoles. There is a closer resemblance in this respect 
between French and North Italian varieties, where the subject clitic can be 
viewed as an inflectional element, than between French and the creoles. 
Thus we can maintain that in the use of person markers most creoles 
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stand apart from the Romance languages proper, though Surselvan 
Rhaeto-Romance and non-standard Brazilian Portuguese have creole-like 
properties. 

1.3 NOUN GENDER 

In the Romance languages proper, as distinct from the creoles, nouns fall 
into two gender classes, known as masculine (m.) and feminine (f) because 
there is a reasonably good, though far from perfect, correlation with sex 
differences where referents are animate beings. Unlike Latin, there is no 
third neuter (n.) gender, even though, as we shall see, there are forms that 
can be regarded as resembling a neuter. Within the Indo-European family 
the historical collapse of a three-gender system into one distinguishing 
animate from neuter is frequent, but the maintenance of a masculine/femi¬ 
nine distinction, with the concomitant loss of the neuter, seems to be 
unique to Romance (though it is found outside Indo-European). 

Romance gender differences are marked most consistently by agreement 
of determiners and adjectives, though, as we shall see, the phonological 
shape of the noun can serve as a good, though not infallible, signal to its 
gender. Gender of non-animates appears to be fairly random though some 
semantic patterns can be discerned, usually reproducing those of Latin: 
for instance, ‘abstract nouns of quality’ - noun derivatives of qualitative 
adjectives - are normally feminine. 

The Latin A-class (first declension) and C-class (second declension) nouns 
were overwhelmingly feminine and masculine respectively, and remain so in 
Romance. Where the final vowels have survived (as in Spanish and Italian) 
-a is still seen as the feminine marker, -o as the masculine, mainly because 
those adjectives that distinguish gender make use of these inherited marks. 
Even in French, where all unstressed final vowels eventually were lost, the -e 
that continued Latin -A is seen as a feminine marker; nowadays, in the stan¬ 
dard language it is a graphical marker usually indicating that the preceding 
graphical consonant is pronounced. In some varieties, on the other hand, it 
is heard as a schwa [o], and in some dialects (e.g. in Normandy) it changed 
the timbre or length of a preceding vowel before being lost. 

The first- and second-declension word markers in Latin are not, how¬ 
ever, totally reliable indicators of gender: in particular some -A forms are 
masculine and designate male human beings: 

(48) AGRICOLA ‘farmer’, poeta ‘poet’. 
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Romance conserves some apparently anomalous cases: Italian poeta has 
the regular masculine plural form poeti, whereas Spanish poetas looks 
morphologically like a feminine plural. 

(a) Latin neuters 

The Latin third gender - the neuter - has no direct reflex in nouns in the 
Romance languages (but see below). Neuters in the u-class normally 
become masculine: 

(49) PRATUM ‘meadow’: Frenchpre, \Xa.\.prato, Span., VXg.prado, Occ., Cat./irar; 
OVUM ‘egg’: French auf Span, huevo, Ptg. ovo, Occ. uou. Cat. ou (for Rum./Ital. 
m. ouluovo, f. plural oualuova - see below). 

Some third-declension neuter nouns with a u in the final syllable are also 
incorporated into the same class: 

(50) CAPUT ‘head’: French chef, Ital. capo. Span., Ptg. cabo; Occ., Cat. cap m,, 
capete f pi.; pectus ‘chest’: Old French pi:, Ital. petto. Span, pecho, Ptg. peito, 
Occ. peitz. Cat. pit (Rum. piept ‘breast of coat’ is masculine, but see below for 
piept ‘chest’). 

In some cases (but see below) the plural second-declension neuter form in 
-A is reanalysed as a feminine singular (with originally collective import): 

(51) OVA ‘eggs’ (/ovum): Span, hueva ‘roe’ (Ihuevo); fata ‘fates’: French fee, Ital. 
fata. Span, hada, Ptg., Occ., Cal.,fada ‘fairy’ (see Span, hado ‘fate’, Vtg.fado ‘a 
kind of song, dirge’ < fatum); folia ‘leaves’: French feuille (Old French also 
fueil m.), IXaX.foglia (Ifoglio ‘sheet of paper’). Span, hoja, Ptg. folha, Occ. folha 
(Jfolh), Cat. fulla (./full), Rum./oa/e; gaudia ‘joys’: French joie (Old French 
also Joi m.), Ital. gioia ‘joy, gem’, Span.yova ‘gem’, etc. seem to be borrowed 
from French (compare Span, gozo ‘delight’). 


(b) Fluctuating gender 

Most descendants of Latin third declension nouns lack an overt mark of 
gender. Not surprisingly, Romance languages frequently hesitate about 
allotting them to one class or another, even when they were not originally 
neuter nouns: 

(52) MAR, MARIS n. ‘sea’: French mer f, Ital mare m.. Span., Ptg. mar m. 

(f in Old Spanish and in modem dialects as well as in some idioms e.g. en alta mar 
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‘on the high seas’), Occ. mar m. or f.. Cat. mar m. ‘body of water’, f. in other uses, 
Rum. mare f.; LAC, LAcns n. ‘milk’: French lait m., Ital. latte m., Sard latte f., Sp. 
leche f., Ptg. kite m., Occ. lack m.. Cat. llet f. (for Rum. lapte, see below). 

SANGUEN, SANGUINIS n. / SANGUIS, SANGUINIS m.’blood’: m. nearly everywhere 
except Occ. sang m. or f, Cat. sang t and Span, sangre f (for Rum. singe, see 
below). MEL, MELLis n. ‘honey’: f. in Rum. (iniere), many Ital. and R-R dialects. 
Cat. (met) and Span, (inlet), but m. elsewhere; pons, pontis m. ‘bridge’: French 
pont, Ital. ponte m., but Span, puente, Ptg. ponte f.; valles, vallis f ‘valley’: Ital. 
valle. Rum. vale f., French val. Span, valle m. 

It is not unknown for both masculine and feminine reflexes of Latin third- 
declension nouns to be found in the same language: 

(53) Span, la /rente, Ital. la fronte ‘forehead’ / Span, el/rente, Ital. il/ronte ‘front 
(military, weather)’ (frons, frontis f); Ital. la/ine ‘end’ / ilfine ‘purpose’ (finis 
m. sometimes f.); French de vieilles gens (f.) ‘old people’ / des gens ennuyeux (m.) 
‘boring people’ (gens, gentis f.). 

Latin fourth-declension nouns in -u- are usually treated as masculine, 
whatever their original gender: 

(54) DOMUS f ‘home’: Ital. duomo ‘cathedral’; porticos f. ‘porch’: French porche, 
lta\. portico; cornu n. ‘horn’: French cor, Ital. corno. Span, cuerno, Ptg. corno 
(but NURUS ‘daughter-in-law’ was early attested in the form nora). 

In the (Southern Italian) dialects of Campania distinctive traces of the fourth 
declension survive: the same form is used for the feminine singular and plural 
in these words: 

(55) ACUS, ACUS, f.‘needle’ > ago (Ital. m. agotaghi); ficus, ficus, f. ‘flg-tree’yico 
(Ital. m. ficolfichi). 

MANUS ‘hand’ (a word learned in early infancy) has, exceptionally, retained 
its feminine gender nearly everywhere, even where it has a masculine-type 
final vowel: 

(56) Ital. inano, Sard manu. Span, mano, Ptg. mao. 

In the Rhaeto-Romance of the Engadine, however, maun, man is masculine - a 
gender also attested in Old French and Old Occitan. In Rumanian and some 
Italian dialects the form of the word has changed to match its gender: 

(57) mind, mana. 

It appears that the plural forms of this word succumb more readily to such 
analogical change in dialectal usage. Nearly everywhere diminutive and other 
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derivatives of manus have overtly feminine endings (though in the Engadine 
maunin is feminine and manut masculine): 

(58) Ital. manina, mamccia. Span, manita, Ptg. maozinha. Rum. minutp 

Not infrequently, any gender fluctuations that affected nouns in the 
Middle Ages were eventually resolved in favour of the Latin tradition, at 
the period of codification of the languages by normative grammarians, 
who were keen that the prestige of Latin should rub off on to the newly 
elevated ‘vulgar’ varieties. This is particularly true of French, where ero¬ 
sion of final vowels has left nouns with no overt gender marker: 

(59) amour f. -> m. ‘love’ (AMOR, amoris m.), erreur f m. ‘error’ (error, erroris 
m.), espace f. -> m. ‘space’ (spatium n.), image m. (originally non-nominative) 
-> f ‘image’ (imago, imagine f). 


(c) ‘Neuter’ gender in Romance? 

For some languages a third gender is sometimes postulated and linked 
with the Latin neuter. I shall examine these so-called ‘neuters’ and seek to 
show that they do not in Romance constitute a separate gender class. 

(i) Spanish 

In standard Spanish nominalized masculine adjectival forms are differen¬ 
tiated into animates and abstracts apparently by the choice of the form of 
the definite article (el m., lo n.): 

(60) el bueno ‘the good man’ / h bueno ‘the good’. 

Furthermore the /o-neuter can be used, though very rarely, with nouns to 
express the inherent quality of the objects referred to: 

(61) lo mujer ‘femininity’ / la mujer ‘the woman’; lo rey ‘kingliness’ / el rey ‘the king’. 

Lo can more readily be used with plural or feminine ‘gradable’ adjectives: 

(62) a. En lo valientes y sufridos ningun 

in the (n.) valiant and enduring (m. pi.) no 

soldado aventaja a los espaholes. 
soldier surpasses to the Spanish 


‘There are no better soldiers than Spaniards in respect of valour and suffering 
hardships.’ 
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b. Habia olvidado to guapa que eres 

I-had forgotten the(n.) handsome(f.) thatyou-are 
‘I had forgotten how pretty you are.’ 

A similar usage is occasionally found in Portuguese, where there is no for 
mal distinction between ‘masculine’ and ‘neuter’; a translation of (71b) is: 

(63) Tinha esquecido o bonita que es 
I-had forgotten the (m.) pretty (f) that you-are 

This may, however, be regarded as more an imitation of Spanish than as 
authentic Portuguese. 

Lo in these uses may be more readily understood as a pronoun than as an 
article: we can compare, for instance, Lo que eres guapa! ‘How handsome 
you are!’ (literally ‘That which you are handsome’). For the close relation¬ 
ship of determiners and pronouns in Romance see 3.3. and 7.5. The dis¬ 
tinction between determiner and pronoun accompanying an adjective is 
more obvious in Spanish with the indefinite form. In this case the pronom¬ 
inal (uno) can be understood as a numeral: 

(64) En la playa vi a cuatro chicos morenosy unluno rubio ‘On the beach I saw four 
dark boys and a/one fair [one]’. 

A ‘definite’ pronoun in such a context would be understood as closer to a 
demonstrative (‘that’) (though without the localization implied by the 
demonstrative). 

For the use of the definite pronoun/article with possessives see 6.5. 

The lo pronoun/determiner can also be used to nominalize other syntag¬ 
mata; in these cases it looks even more akin to a pronoun than a determiner: 

(65) lo de ayer (literally,‘the [thing] of yesterday’) ‘what happened yesterday’; No me 
di cuenta de lo rdpidamente que trascurrian las boras ‘I didn’t realize how (the 
n.) quickly the hours passed’. 

The Spanish ‘neuter’ is clearly not functionally derived from the Latin 
neuter, even though morphologically lo may well continue the neuter 
demonstrative form illud. It can be seen today as a pronoun with [-specif¬ 
ic] semantic import. It is difficult to discern how far the Spanish ‘neuter’ 
function was an ancient popular development, as, in Old Spanish, the mas¬ 
culine definite article el, rather than lo, could be used with substantivized 
adjectives - though it is true that lo was used with possessives (lo mio etc.). 
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In modern times (since the eighteenth century), some writers have used the 
el form with adjectives, though this use may be regarded as rather affected: 

(66) Infelices cuya existencia se reduce al mero necesario ‘Unhappy [people] whose 
existence is reduced to the (m.) mere necessary’. 

For the morphological relationship between el and lo see 7.5. 

(ii) Asturias 

In Central Asturian dialects of Spain, alongside the ‘abstract neuter’ is 
found another phenomenon, which is often described as a neutro de mate¬ 
ria or a ‘collective neuter’. In Asturian there is no special definite article 
form, but in some varieties nouns with non-count meaning can have a final 
-o distinct from the normal masculine -u, sometimes triggering metaphony 
(4.2). 

(67) Ten elpelo (n.) rojo La kayu i unpilu (m.). 

he-has the hair red it fell to-him a hair 

‘He has red hair.’ ‘One of his hairs fell out.’ 

Ye defierro (n.) da me isi fiirru (m.) 

‘It is [made] of iron.’ ‘Give me that iron.’ 

Usually, however, mass-noun status is indicated only by the neuter form of 
the (dis)agreeing adjective: 

(68) Apurreme la cebolla blanca (f) ‘Peel me the white onion’ / Semo la cebolla 
bianco (n.) ‘He sowed the white [species ol] onion’; el pie friu (m.) ‘the cold 
foot’ / elarroz frio (n.) ‘(the) cold rice’; la casafria (f) ‘the cold house’ / elagua 
frio (m.) ‘(the) cold water’; el ome buenu (m.) ‘the good man’ / la mujer buena 
(f.) ‘the good woman’ / la genie bueno (n.) ‘(the) good folk’. 

It has been suggested that the -u form derives from Latin -us (m. nom. sg.), 
while the -o ‘collective’ comes from neuter -UM. More plausible, phonolog- 
ically and semantically, is the suggestion that the -o derives originally from 
an ablative, or a genitive-dative, inflectional ending used with partitive 
sense, and that the ‘disagreement’ has a function similar to that of the par¬ 
titive article in French. The distinction would therefore be one of case, 
marked only on the adjective in some varieties, rather than of gender. 
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(iii) Southern Italy 

A similar phenomenon is also found in parts of Abruzzo, in and near Bari and 
in Lucania (Basilicata). It takes various phonological guises, including a word- 
final vowel (-o/-m) distinction and/or umlauted vocalism (in the -u forms): 

(69) lo ferro (n.) ‘(the) iron’, lopescio (n.) ‘(the) fish’ / lupjettu (m.) ‘the chest’, 
lupesciu (m.) ‘the fish’. 

Here the distinction, as in Asturias, seems to be one of case, rather than of 
gender. 

In some dialects only the form of the ‘definite article’ marks the non-count sta¬ 
tus of the noun. The form of these articles varies from dialect to dialect but 
contrasts are regularly made: 

(70) o latte (n.) ‘(the) milk’ / u pratu (m.) ‘the meadow’; lu pan (n.) ‘(the) bread’ / ju 
kane (m.) ‘the dog’; le sangue (n.) ‘(the) blood’ / i pete (m.) ‘the foot’; le pesce 
(n.) ‘(the) fish’ / la pesce (f) ‘the fish’. 

Sometimes the ‘neuter’ article shows that the noun has an abstract mean¬ 
ing, while the masculine indicates a more concrete sense: 

(71) Abruzzo le ftiokeia.) ‘(the) heat’/Zu fuoke(m.) ‘the fire’; lekorein.) ‘the heart’ 
(figurative, as in le kore de la none ‘the dead of the night’) / lu kore (m.) ‘the 
heart’ (in other senses). 

The Abruzzo neuter article is not only used with mass nouns, but also, as 

in Spanish, with substantivized adjectives, infinitives and past participles: 

(72) le bielle ‘the beautiful’, [b ma'jijia] = Ital. il mangiare ‘eating, food’, 

[la ' skretta] = Ital. lo scritto ‘what has been written’. 

The ‘neuter’ article is also used in Western Abruzzo with forms deriving 
from Latin neuter plurals: 

(73) le prate < prata ‘meadows’, le corpore < corpora ‘bodies’ 
and with newer formations of the same type: 

(74) le capere < cap(ut) -t- -ORA ‘heads’. 

In Western Abruzzo, too, an ‘indefinite neuter’ article ([na]/[nu] m., [na] f.) is 
also found: it is used with a singular mass noun, or with a masculine plural to 
indicate excellence (‘quite something’): 

(75) [na'vina] ‘high-quality wine’/[nu'vina] ‘a wine’; [naka'vella] ‘thoroughbred 
horses’ / [ika'vella] ‘the horses’. 
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In several Southern Italian varieties the initial consonant of mass nouns shows 
a ‘syntactic doubling’ (see 6.3), not found in comparable masculine nouns, sug¬ 
gesting that the preceding definite article, although phonetically identical with 
the masculine form, is phonologically distinct: 

(76) luppane (n.) ‘(the) bread’ / hi liettu (m.) ‘the bed’; u ssale (n.) ‘(the) salt’ / u nase 
(m.) ‘the nose’. 

The distinctive marking of the mass noun is often marginal and fragile. In 

some parts of Lucania, for example, a few nouns are marked by ‘neuter’ 

article ru: 

(77) ru ssale ‘salt’, ru ppane ‘bread’ 

but neighbouring varieties incorporate them into the masculine gender class: 

(78) uppane ‘bread’. 


Are the Asturian and Abruzzan ‘neuters’ related? 

It has been suggested that the South Italian and the Asturian ‘neuters’ are 
directly related historically (perhaps through colonization of parts of 
Hispania by Oscans, during the Imperial period) and that they are relics of 
a once more widespread phenomenon. In Italy, however, agreement phe¬ 
nomena may not be involved, in the way they are in Asturias. Admittedly, 
reduction of final vowels, in many of the Italian varieties, means that 
adjectival agreement is not shown overtly. However, one might expect evi¬ 
dence of metaphonic (‘umlauted’, see 3.2) stem-vowel differences in some 
varieties (where, for instance, a distinction is made between nire (m.) and 
nere (f) ‘black’). The syntactic evidence available about these dialects is so 
patchy, though, that we do not at present know for certain what happens. 

One bit of negative evidence can be found in the nominalized adjectival forms 
at Sulmona (Abruzzo): 

(79) le nire (n.) ‘(the) blackness’ / lu nire (m.) ‘the black man’ 

where, unexpectedly, the ‘neuter’ form is umlauted; however, it may be a lexi- 
calized fossil. Another apparently contradictory bit of evidence comes from 
Servigliana (Ascoli Piceno), in the Marche region, where the ‘neuter’ has an -o 
ending (lo vi ‘wine’), compared with the masculine -u (lu patru ‘the father’). 

Here there is, in the compound perfect, agreement of the past participle with 
the masculine subject of the ‘unaccusative’ (see page 19): 
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(80) Issu e vinutu ‘He has [is] come’. 

Conversely the non-agreeing past participle in transitive verbs; 

(81) Issu ha magnato‘He has eaten’ 

seems to have a ‘neuter’ (or default?) gender mark. 

The similarities and differences between the Asturian and Southern Italian 
cases suggest a marginal and independent survival of an inherited case dis¬ 
tinction, functioning as a mass/count noun distinction, rather than an 
innovation within the gender system. 

For a possible parallel in Alpine Romance, see 3.3. 


(iv) Italian 

Yet another type of phenomenon that is often associated with the Latin 
neuter is that of ambigenous nouns in standard Italian and Rumanian; 
these have masculine gender in the singular and feminine in the plural. In 
other standard languages a few marginal cases of ambigeneity are found: 

(82) Span, el arte gdtico (m.) ‘Gothic art’ (but el arte poetica (f.) ‘Ars Poetica’) / las 
bellas artes (f.) ‘fine arts’ (ars, artis f,); French un delice (m.) / les delices (f.) 
(deliciae f. pi. ‘delights’). 

In Italian more, but still comparatively few, nouns - mainly designating 
parts of the body - are at issue. The feminine plural forms in -a (apparent¬ 
ly originating as the Latin second-declension neuter nominative- 
accusative plural ending -A) often have collective or dual purport: 

(83) il braccio ‘the arm’ / le braccia ‘the arms’ (bracchium n.); il labbro ‘the lip’ / le 
labbra ‘the lips (of human)’ / i labbri ‘the lips (figurative), brims, rims’, also in the 
anatomical sense for ‘closed lips’ (labrum n.); ilfrutto ‘the fruit’ / le frutta, i frutti 
(more frequent) ‘fruits’, also la frutta ‘fruit’ (collectively) with the plural le frutte 
or (more archaic) le frutta (see Span, elfruto ‘fruit’ [growing] / la fruta ‘fruit’ 
[gathered]) (fructus m. 4th declension); I’uovo ‘the egg’/ le uova ‘[the] eggs’ 
(OVUM n.). 

In Swiss Rhaeto-Romance there are traces of a similar distinction: 

(84) Surmeiran igl meil ‘the apple’ / la meila ‘[the] apples’; Engadine il daint ‘the fin¬ 
ger' / la dainta ‘[the] fingers’; il bosca ‘tree’ / la bosca ‘the wood’. 

One indication of the marginal character of ambigenous nouns in modern 
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Italian is the degree of hesitation about gender agreement in co-referent 
pronouns, as in the following literary examples: 

(85) a. dodici uova avvoltate ciascuna in m foglio 

twelve eggs (f.pl.) wrapped (f. pi.) each (f. sg.) in a sheet (of paper) 

b. come insomma se le dita dei piedi non li 

as in-short if the fingers (f pi.) of the feet(m.pl.) not them 
(m. pi.) 

aressi. 

I-had. 

‘as if I hadn’t any toes (? feet)’. 

Some dialects appear to treat the dual form as a masculine singular (as in 
Tuscan) or as a masculine plural (as in Sicilian), while elsewhere (as in some 
South Central dialects and in Old Surselvan Rhaeto-Romance) adjectival 
agreement suggests feminine singular status. 

In older Italian more ambigenous words were current: 

(86) letta ‘beds’ / modern letti (lectus m.). 

There was also a greater number of words with reflexes of the Latin (unac¬ 
cented) third-declension neuter plural ending - 6ra: 

(87) corpora ‘bodies’ / modem corpi (corpus, CORPORA n.). 

Some Central and Southern dialects still make greater use of such forms, 
which are treated as feminine plurals. Although -e remains the most fre¬ 
quent feminine plural marker, it is rivalled in some areas (e.g. near Salento) 
by -uri etc.: 

(88) Abbruzzo tempere ‘times’ (tempus, tempora n.), tettere ‘roofs’ (tectum n.); 
Sicily corpure ‘bodies’; Salento manure ‘hands’ (manus f), akure ‘needles’ 

(acus f). 

In some Western Abbruzzo dialects a collective sense is indicated by a suffixless 
form, while the normal plural is marked by the -ora suffix, both plural forms 
being used with the ‘neuter’ article - see also examples (73) and (74) above: 

(89) lu dite ‘the finger’ / le dete ‘(set of) fingers’ / le detere ‘fingers (individually)’ 
(digitus m.); lu vracce ‘the arm’, le vracce ‘pair of arms’, le vraccere ‘arms 
(individually)’ (bracchium n.). 
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(v) Rumanian 

In Rumanian ambigenous nouns (masculine in the singular and feminine 
in the plural) are much more frequent than in Italian; indeed, some 25-30 
per cent of nouns fall into this class, for which the term ‘neuter’ is often 
used. Slavonic influence is sometimes argued. About a third of the relevant 
nouns have -6ra (modern -uri, earlier -ure) plural forms: 

(90) fir (m. sg.) / fire (f. pi.), ‘wire(s)’ (FILUM n.); piept (m. sg.) / piepturi (f. pi.) 
‘chest(s)’ (PECTUS, PECTORA n.); timp (m. sg.) / timpuri (f. pi.) ‘time(s), tense(s)’ / 
timpi (m. pi.) ‘tempos, epochs’ (tempos, tempora n.);frig (m. sg.) ‘cold’, frig- 
uri (f. pi.) ‘shivers’ (frigus, frigora n.); singe (m. sg.) ‘blood’, singiuri (rare f. 
pi.) ‘murders’ (sanguen, sanguinis n. / sanguis, sanguinis m.); lapte (m. sg.) 
‘milk’, Idpturi (rare f. pi.) ‘sorts of milk’ (lac, lacta n.); corp (m. sg.) ‘body’, 
corpuri (f pi.) ‘bodies’ / carpi (m. pi.) ‘bodies (in scientific terminology)’ (cor¬ 
pus, corpora n.); ochi (m. sg. and pi.) ‘eye(s)’ / ochiuri (f. pi.) ‘fried or poached 
eggs, orifices’ (OCULUS, oculi m.). 

The -uri feminine plural appears to be productive, unlike its equivalent in 
standard Italian, having spread to words of non-Romance origin, especial¬ 
ly monosyllabic words: 

(91) drum (-uri) (from Byzantine Greek via Slavonic) ‘road(s)’, pod (-uri) ‘bridge(s)’, 
ceas (-uri) ‘hour(s)’ (from Slavonic). 

In the modem language there is sometimes oscillation between the -uri and the 
-e feminine plural form, with preference for the latter, which is viewed as ‘more 
Romance’. 

(92) nucleurilnuclee ‘nuclei’; aerodromurilaerodroame ‘aerodromes’. 

In some cases the two plural forms are differentiated semantically: 

(93) raporturi ‘relationships’ / rapoarte ‘reports’; resorturi ‘mechanisms’ / resoarte 
‘departments’. 

It seems that Rumanian ambigenous nouns are usually inanimate (or at any 
rate ‘sexless’), whereas the masculine nouns are predominantly animate. The 
feminine class is by far the largest one and contains mostly inanimate nouns; 
it can be considered as the unmarked gender class. Latin neuter nouns nor¬ 
mally have become feminine in Rumanian, except when they end in a conso¬ 
nant, in which case they most usually are ambigenous. The motivation for the 
ambigenous treatment may be morphophonological, as straightforwardly 
‘feminine’ nouns nearly always end in a vowel in the singular. 
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Rumanian gender is not directly related to sex reference: indeed, com¬ 
pared with Spanish or Italian, Rumanian makes little use of noun-form 
class to differentiate the sex of animates. The so-called ‘neuter’ can be 
viewed as a sub-class of the ‘feminine’, or ‘genderless’, class, in which 
‘masculine’ agreement is used with the consonant-final singular forms: 
‘masculine’ adjectival forms, when they differ from ‘feminine’ forms, usu¬ 
ally end in -u or a consonant, whereas the characteristic feminine ending is 
-a. Exploitation of the possibility of use of feminine plural forms to 
match the singular of some masculine vowel-final non-count nouns - like 
singe, lapte, see example (90) above - is attested, but not widespread, and 
considered by some educated speakers to be deviant. 

It is noteworthy that adjectival forms used as abstract nouns occur, with 
a definite article, in the masculine singular or the feminine plural: 

(94) raul (m. sg.) or retele (f. pi.) ‘the evil’ (from the adjective rau (m. sg.), rea (f. 
sg.), rai (m. pi.), rete (f. pi.) ‘bad’). 

This suggests that the ambigenous type is treated as gender-neutral. 

(d) Function of gender 

The functional role of gender class in Romance nouns is difficult to dis¬ 
cern, though some languages - especially Spanish - make extensive use of 
it to differentiate otherwise homophonous nouns. Among animates, gen¬ 
der can be used as an indicator of sex, most often when there is an overt 
-ol-a gender marker: 

(95) Span, hermano/herniana, ‘brother/sister’, Ital. ragazzotragazza ‘boy/girl’. 

Both in Spanish and in Italian, non-standard varieties make greater use of 
sex-gender marking than the standard. 

Gender differentiation can also hint at other types of relationship: 

(96) Span, la ayuda ‘help’ / el ayuda ‘valet’, la trompeta ‘trampet’ / el trompeta ‘trum¬ 
pet player’; Ital. ilbrodo ‘broth’ / la broda ‘dish-water’. 

In some cases the feminine is more affective: Italian in particular appears 
to use the feminine form of doublets in a wider range of metaphorical and 
non-concrete senses. For example: 

(97) tavololtavola ‘table’ 

where the rarer masculine (sometimes characterized as non-Tuscan) is 
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exclusively used for an item of furniture, whereas the feminine can desig¬ 
nate a wider range of things, like ‘list, index, statistical table’ as well as 
‘board, plank’ and so on. 

Even in French, where overt gender markers in the noun itself have 
mostly disappeared, gender can be used to distinguish homonyms (with 
postulated elision of other nouns): 

(98) le [vin de] Champagne ‘champagne (the wine)’ / la Champagne ‘Champagne (the 
region)’; le [bateau d] vapeur ‘steam boat’ / la vapeur ‘steam’, le [paquebot] 
France ‘the [ocean liner] France' / la France ‘France (the country)’. 

Italian also makes use of this device: 

(99) la Roma ‘the battleship (nave f.) Rome' I il Roma ‘the cruiser (incrociatore m.) 
Rome’. 

Size differences 

One rather curious feature of lexical gender-differentiated pairs in 
Romance is that gender sometimes signals a difference in size. More often, 
but not always, the feminine denotes the bigger one of the pair. This phe¬ 
nomenon appears to be absent from Rumanian and rare in French, but it 
is otherwise quite widespread: 

(100) Span, cuchillo ‘knife’ / cuchilla ‘cleaver’ (though in Italian coltello and coltella 
appear to be interchangeable and in some Spanish varieties the feminine is 
actually smaller - in Puerto Rico, Costa Rica and Peru cuchilla ‘penknife’); 
Span, gorro ‘cap without a visor’ / gorra ‘cap with a visor’ (but talego ‘sack’ / 
talega ‘small sack’); Ptg. caldeiro ‘small kettle’ / caldeira ‘cauldron’; Ital. cavic- 
chio ‘wooden pin’ / cavicchia ‘large bolt, peg’, buco ‘hole’ / buca ‘cavity, large 
hole’ (but the more archaic tina ‘vat’ denotes a smaller vessel than t/no); Cat. 
anell ‘ring’ / anella ‘large ring’. 

The semantic motivation of such differentiation has been seen as connected 
with the larger size of the female in some insect species, or with the effect of 
pregnancy on female mammals. Perhaps more plausibly, it is a remnant of a 
spill-over from the frequent phonetic identity of Latin feminine singular nom¬ 
inatives (in -a) with neuter plural nominatives. It has been suggested that the 
equation ‘feminine = larger’ was found in older stages of the languages, over¬ 
laid later by an association of feminine with diminutives and endearment. 

On the whole, however, noun gender’s main effect is to trigger off agree¬ 
ment in determiners and adjectives modifying the noun. 
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(e) Robustness of gender distinctions 

Although the gender agreement system is by no means watertight in most 
Romance languages (and especially in French), there can be little doubt 
that their speakers are conscious of gender differences, and acquire knowl¬ 
edge of the gender of common words simultaneously with their form and 
meaning. In the creoles there is no gender agreement, nor, it appears, con¬ 
sciousness of grammatical gender differenees - though in Guadeloupe 
some remnants of adjectival agreement (as in movehnovez = French mau- 
vais{e)) are found with animate nouns, and Sao Tome uses lexicalized dis¬ 
tinctions like manolmana ‘brother/sister’. However, it is to be noted that in 
some lexical items especially in French creoles (more frequently in Indian 
Ocean than in New World creoles) the original gender-marked definite 
article has been incorporated into the noun: 

(101) French creole latah ‘table’, lisu(chou) ‘cabbage’(cf. also Principe upa (o pao) 

‘bread’, ufogu (o fogo) ‘Are’). 

This suggests that the gender-marked inherited forms had been under¬ 
stood as genderless, unanalysable entities. 


Uncertainty about gender 

Within ‘core’ Romance, speakers sometimes have doubts about the gender 
of nouns: I have already mentioned fluctuations in reflexes of the Latin 
third-declension nouns which had no overt gender marker. In French few 
nouns are overtly marked for gender, though it is possible to make a rea¬ 
sonably good guess at the gender of a noun on the basis of its phonetic 
shape and, in particular, its final phoneme. Moreover nouns hardly ever 
occur in speech without a determiner which is usually, in the singular, 
clearly marked for gender (but see, for Picard, 7.5). 

Where, however, a French word begins with a vowel, clues to its gender 
will be relatively rare in discourse: for the singular, the immediately preced¬ 
ing definite article has the single form F, the demonstrative the single form 
[set] {cet(te)), the possessive the single form moti and so on. Admittedly the 
indefinite article today differentiates tin [d^ masculine from [yn] feminine 
before vowels in the standard, but this is a relatively recent development 
and some non-standard dialects still use the prevocalic [yn] form for both 
genders (e.g. un eleve ‘a pupil’ [onelev], earlier [ynelev], identical with me 
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deve). It is not surprising then that less-educated native speakers mistake 
the gender of such forms as: 

(102) I'eclair (m.) ‘[the] lightning’, I’horloge (f.) ‘the clock’, I’ouvrage (m.) ‘the piece of 

work’. 

In non-standard varieties of French, uncommon words beginning with 
a vowel and ending in a consonant are almost regularly assigned feminine 
gender, as indeed were some such words in Old French. Whereas in older 
stages of the Romance languages, feminine nouns were probably more 
numerous (as they certainly were in French), in most of the modern lan¬ 
guages there are about equal numbers of masculine and feminine nouns. 
This may be due to the, already mentioned, pressure by grammarians to 
retain or restore Latin gender in Romance reflexes, or to a tendency to 
assign more recent non-Romance loanwords to the masculine gender. It is 
likely that changes in balance between forms with different derivational 
suffixes and borrowings from other languages have had some effect on the 
relative frequency of gender assignment. 

Elsewhere the grammatical tradition, and the partial correlation of gen¬ 
der with sex-reference, has helped to retain and reinforce what threatened 
to become a redundant system. Lack of grammatical gender can be count¬ 
ed a defining characteristic of a Romance creole, as distinct from those 
varieties that have retained even the most tenuous link with the Romance 
tradition. In Rumanian, unlike the other Romance languages, there 
appears to have been, over time, a reduction in the number of purely mas¬ 
culine nouns and a corresponding increase in ‘genderless’ (feminine and 
ambigenous) nouns. Here, as we have seen, there appears to be some ten¬ 
dency to distinguish between animates and inanimates on this basis. 

1.4 IN PLACE OF A PARTIAL CONCLUSION 

In this chapter we have seen that the Romance languages have in common 
some morphosyntactic features that are not direetly inherited from Latin. 
There are even more such features that can be seen as descending from 
Latin: the relationship between Latin and Romance will be explored fur¬ 
ther in chapter 3. 

The systems by which person is marked in the verb paradigm reflect the 
Latin system, but deviate from it in detail. Even in French, however, where 
the Latin system has been most eroded, there is sufficient similarity to 
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other more mainstream languages for it to be set in the Romance rather 
than in the creole camp. In the two-gender system, absent from the creoles, 
we find a clearer Romance identity. The so-called ‘neuter’ gender form of 
some languages I claim does not constitute a separate class and does not 
continue the Latin system. 

The most diagnostic criterion for Romanceness, however, has tradition¬ 
ally been shared lexicon, part of which can be traced back to Indo- 
European origins. In the next chapter we shall look at the common 
Romance lexicon and try to draw some conclusions about the light it sheds 
on our question: What is a Romance language? I shall begin by looking at 
the way creoles use Romance lexical material for grammatical functions 
that look very un-Romance, and consider further the question of whether 
we can distinguish a ‘Romance type’ from the more traditional ‘Romance 
family’. 


FURTHER READING 

On typology, Croft (1990), on grammaticalization. Hopper & Traugott (1993). 
For Romance typology, Coseriu (1988), Manoliu-Manea (1985), Raible ed. 
(1989), Guillet & la Fauci eds. (1984). On morphology, Matthews (1991), 
Plank ed.(1991), Zimmer (1992), Drijkoningen (1989), Fisiak (1980). On 
Romance verb morphology. Bybee (1985), Hall (1983) and especially Iliescu 
& Mourin (1991) - a second volume on diachronic factors is forthcoming. On 
parameter setting, Lightfoot (1991), Ouhalla (1991), Bellettti & Rizzi (1994). 
On the pro-drop question and the status of subject clitics, Jaeggli & Safir eds, 
(1989), and on French, Adams (1988), Vance (1988). On gender, Corbett 
(1991). 
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2.1 ROMANCE ‘family’ AND ROMANCE ‘TYPE’ 

I distinguished earlier between the Romance type and the more traditional 
Romance family. The most convincing and obvious mark of similarity 
between the Romance languages - here not excluding the creoles - is the 
degree to which they share lexicon. Phonological differences sometimes 
obscure the similarities from the casual observer, but even the most naive 
speaker can, with a little effort, discern regular relationships. A high propor¬ 
tion of lexico-phonetic correspondences is the criterion used by traditional 
comparative philology to define the language family - members of which, it 
is assumed, have developed from one and the same language spoken by a sin¬ 
gle language community at some time in the past. The most telling evidence is 
to be found in everyday, ‘basic’ vocabulary and in grammatical items, as these 
are more likely to have been handed on from generation to generation, and 
less likely to have been borrowed from foreign tongues. However, the 
Romance creoles, though sharing most of their vocabulary with the rest of 
the Romance languages, seem to be typologically diverse. I shall later in this 
chapter discuss how ‘Romance’ they can be considered to be. But first I shall 
examine the sort of vocabulary that is shared by all Romance languages, and 
discuss the implications for classification. 

2.2 ROMANCE FUNCTIONAL MORPHEMES 

In the Romance context, it is not surprising that the shared ‘basic’ lexicon 
is, very often, similar to that of Latin, or at least of a popular version of 
Latin. We have already seen that Romance verb person markers can be 
linked to Latin forms. Other morphological features will be examined in 
the next chapter, where Romance is systematically compared with Latin. 
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Here we shall concentrate on some ‘function words’. To begin with we 
shall look at forms that have been adopted by the creoles and put to uses 
that are unfamiliar in ‘core’ Romance - the so-called TAM (tense/ 
aspect/mood) markers that distinguish most sharply the creoles from 
other Romance languages. 

(a) Creole tense/aspect/modal markers 

Even more than the absence of verb person endings the morphosyntactic 
feature that most clearly excludes the Romance creoles from the ‘Romance 
type’ is the fact that they have no synthetic verb tenses or moods, but, 
instead, prefix tense, aspect or modality markers, in various combinations, 
to invariable lexical verb forms. What is striking in this respect is that the 
TAM forms used in the creoles are nearly always of Romance origin. The 
use to which they are put looks totally unlike Romance, but they may 
sometimes reflect Romance periphrastic contractions (see 3.3, 4.5). The 
purport of the verb prefixes varies from one creole to another, but the 
overall pattern is so similar, whatever the lexifier language, that many theo¬ 
rists believe that it must be taken over from another language type - 
Western African languages, for instance. 

In Indian Ocean (Seychelles and Mauritius) French creoles, for instance, 
the preverbal markers have the following forms (approximate functions are 
indicated in capitals): 

TAM markers in Indian Ocean creoles, with French equivalents 
PAST, title (< etait) ‘was’; 

PROGRESSIVE'. (a)pf (< apris', cf. the seventeenth-century and Canadian French periphrastic 
progressive je suis apresfaire ‘I am (after) doing’); 

PERFECTIVE', fi (n) (< finir ‘finish’); 

RECENT PAST, fek (< ne fait que = ‘(he) has only just done’) or soti (< sortir ‘go out’; cf il vient 
de... ‘he comes from...,’ i.e. ‘he has just...’); 

NEGATIVE', pa (< pas)', 

future: a (< va; cf il va faire ‘he is going to do’, pu{< pour or pouvoirl Cf archaic il est pour 
faire ‘he is about to do’ and il pent faire ‘he can do’). 

The two FUTURE markers seem to have slightly different modal meanings: a is 
variously described as ‘indefinite’, or ‘expressing certitude’, while pu is ‘defi¬ 
nite’, ‘immediate’ or ‘uncertain’. Ulelvie (< vouloir) is also used apparently as a 
FUTURE marker: in nonstandard French il veut faire ‘he can do’ can be used in 
the same way as il va faire ‘he’s going to do’. 
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French creoles in America and the Caribbean use similar markers: besides 
ape {ap, apr, pe), te, {fi)n and {v)a we find s(r)e (< serait) as a conditional 
marker. Forms for which no obvious French etymon can be found include 
ka or ko as a habitual marker (of African origin, or < capable!) and also, 
in some parts, a future marker ke, which is usually seen as a West African 
particle. In the Caribbean islands St Barth and St Thomas, the patois, like 
some Louisiana cajun varieties, use a construction with ki {qui) which may, 
however, be at the root of ke: 

(1) Jesuisquival Ma quiva lefaire 
I- am who goes / me-go who goes it to-do 

‘I am going’ / ‘I shall do it’ 

Portuguese creoles also use markers in similar ways but with variable 
forms. In the erioulo of Guinea-Bissau and Cape Verde, for instance, na 
(Ptg. ‘in the f.’) is a progressive verb marker, as well as indicating location 
when used with an inherited noun: 

(2) Ina kuuri ‘He is running ’! Ista na kasa ‘He is in the house’. 

Here, as in some other creoles, there is no clear distinction between noun 
and verb, and na seems to be used to denote some sort of continuity in 
space or time, as in some African languages. Cape Verde varieties also use 
a more Portuguese-looking continuous verb form: 

(3) Sta entrano - Ptg. Estd entrando ‘They are coming in’; 
as well as an apparently periphrastic future-. 

(4) N sta faze ‘I am to do’ 

and a ja (‘already’) completive in which the TAM marker uncharacteristi¬ 
cally precedes the subject pronoun and sometimes is repeated after the verb: 

(5) Dja n daba nhapalabra ‘I have given my word’ / Dja’l come dja ‘He has eaten 
already’. 

In Other African Portuguese creoles the ja particle occurs clause-finally: 

(6) Guinea-Bissau Ibaj dja ‘He has gone’; Sao Toms Nfla za ‘I have spoken’. 

In Kristang (< Ptg. cristdo ‘Christian’) spoken by some 1,500 people in 
Malacca, Western Malaysia.yd acts, on the other hand, like other preverbal 
markers; 


(7) 


Yo ja chegd ‘I arrived’. 



74 


The similarities 


The FUTURE marker in crioulo is ba (< vai ‘go’) or al. The marker ta 
(< esta) is probably basically a non-past, non-continuous marker, used as 
a HABITUAL or ITERATIVE. In the Portuguese-based Gulf of Guinea creoles, 
on the other hand, an apparently non-Portuguese ka is the habitual 
marker (as in the Asian Portuguese). Na (< nao) is here used for the nega¬ 
tive marker, which in some other Portuguese-based creoles has the form 
ka (< nimca). In Sao Tome, however, we find for the negative, besides na, 
a clause-final fa, perhaps of African origin. A more familiar looking ska 
(< sa ‘be’ + ka) here marks the progressive, te (< ter ‘have’) the distant 
FUTURE, ta (< esta) the past, and bi (< vir ‘come’) some degree of compul¬ 
sion. In Sri Lanka the progressive form is te. 

In Curagao papiamento imperfective ta and habitual sa (? < saber 
‘know’) markers are found; ke (? < querer ‘want’) seems to be a modal, 
while irrealis lo (< logo ‘then’) can be used as a euture marker (as it is 
in Asian creoles, but there it usually precedes the subject pronoun, act¬ 
ing like a sentence adverbial). The past tabata (older taba, colloquial 
ta’a) seems to be derived from the Hispanic imperfect estaba with agglu¬ 
tination of ta ‘to be’ (cf. tabatin ‘had’ with agglutination of tin ‘have, 
hold’). 

Philippine Spanish varieties also make use of particles of apparently 
Romance origin, prefixed to invariable verb forms: ta non-punctual 
{habitual and progressive), ya ‘now, already’ perfective (also post- 
posed to the verb) and de or aj (? < de ‘of’, hay ‘there is’) future. 

One notable feature of many Portuguese creoles, and of Colombian 
Spanish-based palenquero, is their use of a ba (or va) particle to indicate 
past tense. This does not seem to be related to the Catalan va preterite (see 
4.6) but may be derived from the imperfect -aba inflection of Portuguese 
(see 3.3). It is sometimes used in a different way from verb prefixes: 

(8) Guinea-Bissau I bin ba ‘He cwie'', Iprofesor ba ‘He was a professor’; 

ba ‘He knew you’; Cape Verde El ta ba chiga I El ta chiga ba ‘He used to arrive’; 
Sao Tome E ta va nda ‘He went’, E ta va ka nda ‘He was going’. 

Some think the ba particle is of African origin, but it is possible that here 
we have an agglutination of the Portuguese verb inflection rather than the 
development of a suffixed particle. 

Interestingly enough, in other rather conservative creoles - in Louisiana 
and the Indian Ocean island of Reunion - remnants of an imperfect 
inflection have survived: 
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(9) Louisiana (St Martin) mo kup ‘I cut’ (present) / mo kupe ‘I cut’ (past) = moi 
coupe I moi coupais\ Reunion mi i maz I mi i maze ‘I eat / ate’. 

Some linguists see in such forms the effects of a decreolization process that 
has re-introduced flectional endings into languages that normally use agglu¬ 
tinated prefixes. Whether the imperfect flections are survivals or re-introduc- 
tions, their existence helps to blur the distinction between Romance 
‘dialects’ and creoles (see 5.7). That the imperfect flection is the only one to 
be found among creoles may be of particular significance, for, as we shall 
see, the only two Latin tense-aspect forms that survive intact in every 
Romance ‘dialect’ are the (unmarked) ‘present’ and the (past) ‘imperfect’. 

It can be maintained that creole verb prefixes are so alien to Romance (and 
to other Indo-European languages) that their origin must be sought elsewhere 
- especially as similar features are found in non-Romance creoles. Yet it is true 
that all Romance dialects have developed compound and periphrastic verb 
forms in which an auxiliary precedes a non-finite form of the lexical verb. 
Some Romance languages make greater use of such forms than others. 

One notable feature is that, usually, the non-finite lexical verb form con¬ 
tributes to the aspectual or modal meaning conveyed; a gerund(ive) for 
progressive aspect, a past participle for perfective or resultative, and an 
infinitive for modalities. In creoles the verb form is nearly always clearly 
derived from an original infinitive. Portuguese and Spanish creoles, how¬ 
ever, often retain Romance-like forms (as we have seen for Cape Verde). 
Papiamento, for instance, distinguishes mata ‘kill’ from mata (< matado) 
‘killed’, as well as, like palenquero, retaining remnants of the progressive 
type estar + present participle: 

(10) Papiamento E tabatapapyando ‘He was talking’; Palenquero / ta kanta(ndo) ‘I 
am singing’. 

It is noteworthy that in French, periphrastic formations with the 
gerund(ive), or present participle, have fallen out of use, while those which 
use the infinitive have been extended (see 4.5). Moreover, the past participle 
is often indistinguishable from the infinitive, so that, for instance, a form 
like II a chante [ilajate] ‘He sang, he has sung’, could readily be reanalysed 
as a structure similar to II va chanter [ivajate] ‘He is-going to-sing’. 

French, especially in its ‘popular’ varieties, is often regarded as the most 
‘advanced’ of the Romance languages, in the sense that it is nearer to creoles 
than the other languages. We shall later (5.7) compare and contrast extreme 
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versions of French patois that exist alongside French creoles (in Louisiana and 
the Caribbean), in the light of the hypothesis that creoles are new creations 
fashioned by a language bioprogramme. 

There is little to suggest that in the Romance creoles the TAM markers are, 
or were at one time, limited to Bickerton’s bioprogrammatic functions of 
ANTERIOR (? = my past), irreal and non-punctual. Most Romance cre¬ 
oles distinguish future, habitual and progressive, but in these latter 
two cases some creoles use a bare verb form for one or the other. In nearly 
all cases the TAM markers can be linked etymologically, and to some 
degree functionally, to Romance periphrastic formations with similar 
functions. One difference, however, which does seem to hive off the creoles 
from other Romance languages is the way in which negation often appears 
to be treated as a modal and so part of their TAM system (for discussion 
of Romance negation see 7.4). 

Certainly the creole TAM systems are very different from those of 
Romance, which still relies on morphological inflections rather than inde¬ 
pendent morphemic markers to signal tense, aspect and mood. Where 
compound or periphrastic verb forms have been developed in Romance, 
they have tended to be grammaticalized to become virtually new inflec¬ 
tions. On the other hand, some of the so-called creoles do have verb fea¬ 
tures that resemble those of ‘core’ Romance. For the question of ‘decre- 
olization’, alleged dilution of creole features due to close contact with 
non-creole Romance languages, see 5.8. 

(b) Personal and possessive pronouns 

The ‘discourse’ pronouns (persons 1, 2,4, 5) in the Romance languages are 
strikingly similar to those of Latin (for persons 3 and 6 pronouns see 7.5). 
It is noteworthy that the Latin pattern holds for most Indo-European lan¬ 
guages, so we may talk here of long-term retention of Indo-European 
forms, rather than specifically Latin-to-Romance conservatism. 

It is hardly necessary here to reproduce the tables of pronominal forms that are 
to be found in all the manuals. Some comments are, however, called for. 

No Romance language perpetuates the Latin genitive forms mei, tui, 
NOSTRi, VESTRi, but most use adjectival possessives of the meus, tuus, 
NOSTER, VESTER type. Many of the modern forms, however, show irregular 
phonological development, often due presumably to paradigmatic levelling. 
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Thus a proto-Romance *vostru is postulated as the predecessor of 
most person 5 forms, and some person 2 forms have been remodelled on 
person 1: 

(11) *teum: French tien, Ptg. teu (but feminine tm). Cat. teu. Rum. tau / tuum: 

Ital. tuo, French, Cat. ton. Span, tu, tuyo (influenced by cuyo < cuius ?). 

In Surselvan and Engadine Rhaeto-Romance and in Portuguese persons 4 
and 5 pronouns appear to derive from unattested (archaic or dialectal?) 
forms *NOSSU, *vossu. 

In those parts of Southern Italy where final vowels are reduced to schwa, the 
possessive often has a form identical with that of the object pronoun and is 
enclitic to the noun: 

(12) Abruzzo [lu'libbrama] ‘[the] my book’. 

(For the use of the definite article with possessives see 6.5.) 

Enclitic possessive forms were also used, without a definite article, in liter¬ 
ary Tuscan until the fourteenth century, when they were condemned as 
‘low’, but they were always distinct from object pronouns: 

(13) mdgliata ‘your wife’, signorso ‘his lord’. 

Creoles often use invariable postposed pronouns as possessives, with or with¬ 
out a preposition: 

(14) Guadeloupe timu-a-u ‘little-person-to-you’ i.e. ‘your child’; Haiti (Port-au- 
Prince) pitit mwe, (Northern) pita-m ‘my child’; Sao Tome manu-m ‘my broth¬ 
er’, ke no ‘our house’. 

Some creole varieties, however, use reflexes of the possessive (sometimes, in 
Portuguese creoles, generalizing the feminine form): 

(15) Mauritius mo madam ‘my wife’, nu takas ‘our house’; Cape Verde mye irmo ‘my 
brother’, Asian Portuguese minha pai ‘my father’. 

Some ‘core’ Romance varieties also prefer a combination of preposition a(d) 
or DE with a pronoun to specifically possessive forms. 

The distinction between nominative EGO, tu and accusative me, te is 
maintained by the ‘core’ languages (though phonetic erosion has often 
rendered tu and te indistinguishable). However, some dialects, in particu¬ 
lar in France and north Italy, use, as disjunctive pronouns in all sentences 
roles, stressed reflexes of the old accusative forms: 

(16) French Moi, je te donne fa, d toi ‘Me, I give that to you’. 
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In most French creoles an originally stressed-type pronoun, mwSmo etc., is 
found in every sentence role and there is no cliticized nominative. African 
Portuguese creole n/[ji] unstressed person 1 pronoun may be a phonetic devel¬ 
opment of mi, or it may be of Bantu origin. Other Portuguese creoles prefer 
reflexes of eu. 

For persons 4 and 5, whereas Latin did not distinguish nominative from 
accusative, some Romance languages tend to add an element ALTERi /-OS 
‘others’ to the nominative, or at least to the disjunctive, form. In post- 
medieval Spanish and Catalan, the element is obligatory: 

(17) disjunctive nosolrosl-as, nosaltres and vosotrosl-as, vosaltres, alongside clitic 
accusative-dative nos, os and vos, us. 

In standard Italian noialth and standard French nous autres are optionally used 
for noi and nous. In some French creoles zot (from vous-autres) is the only form 
for person 5 (and, sometimes, for person 6). In Friulian no(altm), vo{altris), the 
altris appears to be obligatory for the postprepositional pronoun. 

In Italian the accusative-dative clitics of adverbial origin, ci and vi, along¬ 
side disjunctive noi, voi, began to replace no and vo, from the thirteenth 
century. 

Generalization of vos, originally used as a polite substitute for xu (see 
1.2), is found in several areas, including Haitian and most African 
Portuguese creoles (except in Principe). In American Spanish, vos is often 
used as the familiar nominative form {voseo - see 1.2), though te may be 
retained as the object clitic pronoun. Some Portuguese creoles, however, 
have generalized other originally honorific forms: 

(18) Cape Verde (Sotavento) (flosinozi) (from os senores ), (Barlavento) hoses (from 
wee + s), Sao Tome (i)nanse, Anobom namesezi (perhaps from na, a pluralizer 
of African origin + merces ‘grace’). 

In Colombian palenquero, ute (Spanish usted) has similarly been generalized. 

In Argentinian popular speech the definite-article form los is used for the clitic 
reflexive nos'. 

(19) Los vamos ‘Let’s go’ (more vulgarly Se vamos, with person 3 reflexive general¬ 
ized to all persons). 

Most languages have no reflex of the Latin accusative/dative distinction 
(me/mihi, te/tibi, nos/nobis, vos/vobis). One apparent puzzle is the rela¬ 
tionship of Italian (tonic) me, te / (atonic) mi, ti, to Spanish (tonic) mi, ti I 
(atonic) tne, te. Are these a result of different phonological rules, or are 
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they haphazardly derived from me, te / mihi, tibi? Even more puzzling are 
such Spanish forms as migo (? < mihi + cum) used after con (< cum). 

Sard, Southern Italian dialects, Rumanian and Galician retain some reflexes of 
both accusative and dative forms, but do not use them in their Latin functions 
(see 3.3). 

Whereas in Latin personal pronouns were full nominals, in the Romance lan¬ 
guages clitic (conjunctive) forms have evolved, derived from unaccented 
forms, alongside originally accented disjunctive (autonomous or post-prepo¬ 
sitional) forms. In the creoles, as we have seen, there is no such systematic dis¬ 
tinction, nor is there in modem Surselvan Rhaeto-Romance, where use of 
disjunctive forms, instead of earlier clitics, is thought to be a recent innova¬ 
tion, due to influence from German. In all other languages, object pronouns 
are cliticized (see 4.4); in French and North Italian varieties, clitic subject pro¬ 
nouns have become, as we have seen (1.2), virtually bound person markers. 

(c) Prepositions 

More than a third of the prepositions in common use in Romance are rec¬ 
ognizably related to Latin ones, though often they do not mean exactly the 
same. The following are some of the virtually pan-Romance items, preced¬ 
ed by approximate English translations: 

Some pan-Romance prepositions 

‘above, on’: super, supra,: French sur, Ital. sopra. Span, sobre. The Latin adverb sursum 
‘up(wards), over’ also has reflexes in most languages: French (au des) sus (de), 
Rum. sus ‘above’ / spre < super ‘towards’ Ipe< per ‘on’, Old Span, suso, Ital. 
su. In some French creoles lao (Id-haut ‘(high) up there') replaces sur. 

‘against’: CONTRA: French centre, Old Ital., Sard, Span., Ptg., Occ., Cat. contra, modern Ital. 

contro. Rum. cdtrd, Engadine hunter, Friulian kuintri. 

‘at, to’: ad: a everywhere - in Rumanian it is used only in adverbial phrases, la (< illac-ad) 
‘there’ replacing ad in its original sense: a similar form is used in Abruzzo and 
Friuli. Some French creoles use [kot] (< (d) cote (de) ‘by the side of’, for 
French d. 

‘before, in front of’: ante, including various combinations with other prepositions 

(ab/de/in/de, ab/de, in/ad, ad/de). Usually temporal use is distinguished 
from spatial: French avant, devant, Ital. avanti, davanti, innanzi, dinanzi. Span. 
antes, delante (older denante), Ptg. antes, adiante. Rum. inaintea Idinainte. 
‘between’: inter: spelt enire nearly everywhere; Sard inter. Rum. intre. 

‘from, of’: de: de nearly everywhere. Ital. di; composite forms include Ital., R-R (Engadine) 
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da<DE + AB, Rum. din < de + in and Span, desde < de + ex + de. 

It has been claimed that ‘true’ creoles lack a range of prepositions, using 
instead serial-verb constructions; 

(20) Guyana Li pole sa bay mo (< French Lui porter (a haiUe (‘give’) moi) ‘He 
brought it for me’; Haiti Mwe kouri ale lakay li (< French Moi courir alter la 
case lui) ‘I run to his house’. 

Serial verbs are found in a number of Romance creoles, though not in Indian 
Ocean French creoles. However even those varieties that use verb serialization 
still have a range of prepositions that match those of their lexifier language: 
thus Haiti ak ‘with’, pu ‘for’ from French avec, pour, compared with Principe 
ki,pa from Portuguese con, para. 

2.3 WORD FORMATION 

In the modern Romance literary languages between 60 per cent and 80 per 
cent of any text will consist of words apparently used uninterruptedly from 
Roman times and clearly related to Latin. Among loanwords, a large pro¬ 
portion are borrowed directly from Latin or the other Romance languages, 
so that the ‘Romance’ character of the vocabulary is overwhelmingly obvi¬ 
ous. Moreover, about 10 per cent of the words in any text will be created 
from native sources: it is not always certain whether such words were already 
in use during the Imperial period or whether they were later creations. 

The resources for creating new words are similar in all the Romance lan¬ 
guages. Change of function class - from noun to adjective and vice versa 
and from verb infinitive to noun, or from participle to adjective etc. - is 
particularly frequent. 

Reduplication of adjectives or of verbs for intensification is also wide¬ 
spread in the colloquial varieties: 

(21) Ital. pian piano ‘very quiet’; Span. Entender, entiendo ‘I certainly understand’. 

Creoles make even wider use of this device, some believe under African 
influence: 

(22) Mauritius Li ti marse marse ‘He trudged on and on’; Sao Tome O pegapega 
pishi ‘He catches fish often’. 

However, it is to be noted that Sard varieties also make extensive use of 
similar reduplication: 


(23) 


Nuoro Mori mori ‘He dies, he dies’, i.e. ‘He is about to die’. Time time ‘He 
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fears, he fears’, i.e. ‘He is terrified’. 


(a) Compound words 

Compounding seems to be less common in Romance than in Germanic 
languages, though it appears to be on the increase, and has always been a 
possibility: 

(24) French gratte-ciel ‘scratch-sky’, i.e. ‘sky-scraper’, rouge-gorge ‘red-throat’, i.e. 

‘robin’; Ital. dopopranzo ‘after-lunch’, i.e. ‘afternoon’, nettadenti ‘clean-teeth’, 
i.e. ‘toothpick’; Rum./ioarea-soarelui ‘flower of the sun’, i.e. ‘sunflower’, papd- 
lapte ‘pope-milk’, i.e. ‘milksop’; Span, lavamanos ‘wash-hands’, i.e. ‘wash- 
basin’, aguardiente ‘water-burning’, i.e. ‘brandy’. 


Adverbs in -mente 

The most widespread (though not pan-Romance) procedure for formation 
of adverbs also originated as compounding, though in most of the lan¬ 
guages it is now suffixal. Already in Classical Latin, instead of an adver¬ 
bial form of -ITER, the ablative of the word for ‘mind’ (mens, f., ablative 
mente), most often following an agreeing adjective, could be used to indi¬ 
cate the manner in which an action is performed: as in placida mente 
‘with peaceful mind’. Mente was to become virtually an adverbializer, 
which could be suffixed to nearly all qualifying adjectives: 

(25) French doucement ‘softly’, Ital. raramente ‘rarely’, Ital. dial. (Bologna) breve- 
meint ‘briefly’, Occ., Cat., bellamen(l) ‘beautifully’. Span, distintamente ‘dis¬ 
tinctly’, Ptg. cruamente ‘cruelly’, R-R (Engadine) puramaing ‘purely’, 
(Surselvan) finalmein ‘finally’, Dalmatian (Veglia) fuartemiant ‘strongly’, Sard 
finalmenti(s) ‘finally’. 

Constraints on use of the mente suffix have led some linguists to postulate 
that it still retains some autonomy. For instance, rather like the English 
suffix -ly, it is less readily used with adjectives referring to strictly physical 
properties, suggesting that it still has a ‘mental’ tinge: 

(26) Ital. *giallamente ‘?yellowly’, *bruttamente ‘‘uglily’, *lungamente (with refer¬ 
ence to space) ‘‘longly’. 

In early Romance the two parts of the syntagma probably retained 
greater autonomy, as other words could be interposed, and, as in 
sequences of two adverbial phrases, the mente could be omitted in one - 
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most frequently the first (as in modern standard Spanish and Portuguese) 
but also the second (as in Old Occitan and Catalan): 

(27) Ptg. clara e distintamente ‘clearly and distinctly’. Span, lenta y cuidadosamente 
‘slowly and carefully’. Cat. bellament i dolya ‘nicely and gently’. 

In modern Spanish and Portuguese, adverbial forms in -mente also, unusu¬ 
ally, have two stresses, suggesting that they are treated as two words. 

In Old French there seem to be no unambiguous examples of omission 
of -merit in sequences, and it is probable that it was already an adverbial 
suffix, homophonous with the -mentum suffix (see below). French appears 
to have retained no other inherited trace of the Latin word mens, except in 
the now extinct word for ‘remember’, mentevoir. In modern French, too, 
the adjectival element is not always distinctively marked as feminine (see 
evidemment [evidama] with evidente f [evidat]). 

Apart from modern Occitan and Dalmatian, the other Romance languages do 
have reflexes of the word MENTE, though some of them may be Latinisms: 

(28) Ital., Sard (Logudorian), Span., Ptg. mente. Rum. minte. Dolomite R-R (Gardena) 
ment. Old Span, miente survived till the Golden Age and till this day in expres¬ 
sions like Iraer a las mientes ‘to bring to mind’. Old Occ. men (f) is not now 
current; Cat. ment (m.) is not common and is possibly modern. 

In some languages the adverbial suffix was differentiated from the noun by 
the addition of a so-called ‘adverbial -s', allegedly on the analogy of 
adverbials like minus: 

(29) Old Occ. fortamens ‘strongly’. Old Ptg. claramentes ‘clearly’. 

Elsewhere an -r- was incorporated into the mente element: 

(30) Old Span, solamientrelsolamiente. Ital. dial. (’Veneto) solamentre ‘only’; Old 
Lombard malamentre ‘badly’; R-R (Friuli) stupidamentri ‘stupidly’, 
(Dolomites) fermamenter ‘firmly’; Rum. altminterilaimintre ‘otherwise’. 

The most likely source of the -r- is the Latin adverbial marker -iter, which 
may have become attached to mente C^menter?). Spanish mientre, ousted in 
the fourteenth century by ‘learned’ mente, is thought by some to show contam¬ 
ination with {do)mientre < (dum)-interim ‘while’. 

Whatever their origin, it does seem that the additions differentiated the 
adverbial ending from the mente noun forms: it is possible that learned 
influence led to the restoration of more transparent forms in Spanish and 
Portuguese at the beginning of the modern era. It has also been main- 
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tained that Italian -rnente (unattested before the mid-eleventh century) was 
a borrowing from either French or Latin rather than a native development, 
and so evidence about its status as a bound suffix or free form is unclear. 

In the extreme south of Italy and in Rumania the mente construction 
does not appear to have caught on, though some examples, presumably 
loanwords, are found (e.g. Rum. literalmente). The most usual device con¬ 
tinues the use of the neuter adjectival form as an adverb, which was fre¬ 
quent in later Latin: 

(31) Rum. cinta frumos ‘to sing beautiful(ly)’, Calabria mi diinanu sulu ‘They only 
[alone m.] give me’ = Ital. mi offrono solamente. 

In Latin America, especially, similar usage is frequent in popular speech; 
indeed, with intransitive verbs it is common in all Spanish colloquial 
speech. 

In some Southern Italian dialects the adjectival form used in this way can agree 
in gender with the subject of the verb: 

(32) Conidoni (Calabria) Kusisti bonulbona, San Chirico Raparo (Lucania) Ai kusii- 
tu bwenuibnena ‘You have cooked well [good m./f]’ = Ital. Hai cucito bene 

A not dissimilar usage is normal in Spanish with some verbs: 

(33) Hablas ligerola ‘You talk quickly (quick m. /f.)’. 

It has been maintained that the mente construction was unknown in 
‘Eastern’ Latin though examples from Dalmatian (25 above) and 
Rumanian (30 above) suggest that this cannot be wholly true. The absence 
of the MENTE construction in Campania, Abruzzo, Apulia, Lucania and 
Calabria has been ascribed to the influence of Greek as used in Magna 
Graecia (the part of southern Italy that was within the Eastern Empire), in 
which adverbs were frequently phonetically identical to adjectives. 

Even in the languages where -mente is the normal adverbial suffix, short and 
much-used adjectives double as adverbs, in their masculine singular form: 

(34) French faux, fort, Ital. alto, piano'. Span, claro, temprano. 

We should note that in Rumanian the numerous adjectives in -esc (< iscu, or of 
Slavonic origin?) have a special adverbial form in -e§te: 

(35) prote^te ‘foolishly’ / protesc ‘foolish’. 
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(b) Derivation 

Apart from this Rumanian case, derivational procedures are strikingly 
similar in all the Romance languages: a number of suffixes and prefixes are 
pan-Romance and, though not always fully productive in modern times, 
they are usually easily recognizable as bound morphemes. Some of them, 
admittedly, are virtually pan-European - borrowed from Latin or Greek - 
but Romance languages probably make greater use of them than do other 
languages. Standard French has, since the seventeenth century, tended to 
restrict derivation, except with ‘learned’ affixes, though Canadian French 
continues the earlier habit of proliferating near-synonymous derivatives. 

All the Romance languages, except modem French, make great productive 
use of expressive suffixes, especially diminutives, augmentatives and depre- 
ciatives: 

(36) Span, hermanito ‘little brother’, Ital. omaccione ‘nasty big man’, Ptg. livrinho. 
Rum. carticcicd‘little book’. Cat. cadirota ‘ugly old chair’, Occ. galinassa ‘great 
big hen’, R-R (Surselvan) vialscha ‘bad road’, Sard (Nuoro) minoreddeddeddu 
‘very, very little’ (with three repetitions of the diminutive suffix). 

In the creoles, on the other hand, bound derivational affixes play little or no 
role in lexical creation, though crioulo (Guinea-Bissau and Senegal) has a pro¬ 
ductive causative -iit- affix (perhaps of Madinka origin, or generalized from 
verbs like levantar ‘to raise’, adiantar ‘to put forward’): 

(37) tjordnta ‘to make weep’ / tjora ‘to weep’ = Ptg. chorar. 

(i) Prefixes 

Some Romance prefixes were originally detachable prepositions (de, ab, 
AD, CONTRA, CUM, EX, IN, SUB) that have become agglutinated to free 
forms, especially to verbs. It is thought that spoken Latin must have had a 
predilection for extended forms - with intensive or expressive affixes - 
which have often survived into Romance at the expense of the simple 
forms and are no longer recognized as derivatives: 

(38) French acheter ‘buy’ < ad-captare; Ital. cominciare, French commencer. Cat. 
comensar. Span, comenzar, Ptg. comefar < cum-initiare; French souffler, Ital. 
soffiare, Ptg. soprar. Rum. sufld ‘blow’ < sub-flare. 

A prefix that is still analysable as such, if not productive in all lan¬ 
guages, is the iterative or intensive bound form RE-: 
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(39) Ital. riandare, ‘go again, go back’. Span, realzar, Ptg. realfar ‘elevate, enhance, 
raise up’ (intensive), French ravoir ‘regain, have again’, Rum. revedea, ‘see 
again’. 

This prefix is also productive in the creole of Mauritius: 

(40) Li n re-vini ‘He came again’. Mo ti re-ekrir zot ‘I wrote to them again’. 

The French creole of St Lucia seems to use a de- prefix in a similar way as 
an intensive or iterative: 

(41) depale ‘rant’ / pale ‘speak’ = French purler 

Dis- is widespread with negative force, like English un- or dis-\ 

(42) Ital. discaricare. Span, descargar ‘unload’, French defaire, Rum. desface ‘undo’. 

(ii) Suffixes 

Sufiixation is the most common means of derivation in Romance, and most 
of the pan-Romance suffixes are reflexes of Latin ones. Not infrequently a 
Latinate borrowing has ousted a cognate native suffix, as in French, where 
-aison (e.g. pendaison ‘hanging’) from -atione(m) is no longer productive, 
while more modern -ation (e.g. cultivation) remains so. Less clearly one of 
borrowing as against popular development is the contrast between [a'tsjone] 
and [a’d 3 one] in Italian and between [a'Ojon] and [a'Gon] in Spanish: 

(43) Ital. sahazione ‘salvation’ / venagione ‘game, venison’; Span, ligacion, ‘tying, 
binding’//igacdn ‘tie, bond’. 

The -tione(m) suffix is very productive: 

(44) Rum. putreziciune ‘dissolution’; Span, embarcacion ‘embarkation, ship’; Ptg. 
liquidapao ‘liquidation’; Sard (Logudorian) [kan'done] ‘singing, song’. 

Other still more or less recognizable pan-Romance abstract noun suffixes 
include: 

Pan-Romance abstract noun suffixes 

-MENTU: French logement ‘lodging’, Ital. stiramento ‘pulling’. Span, acercamiento ‘approach’. 
Rum. vesmmt ‘clothing’, Sard (Campidanian) iuvamento ‘help’, Curasao (papi- 
amento) perdementu ‘loss’; 

-TATE: French cherte ‘dearness’, actualite ‘current happening’, Ital. sanitd ‘health’. Span, leal- 
‘loyalty’, ‘reality’, Cat.falsedat ‘falseness’, Rmn. pufinate, Sard 

(Nuoran) minoridade ‘smallness’; 
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-itia: Vr&ach franchise ‘[rdnkness’, finesse "fineness’, Span, limpieza ‘cleanness’, Ital. bellezza. 
Rum. dulcea^, ‘sweetness’, Cat. aridesa ‘dryness’, Ptg. magreza ‘thinness’, 

Sard (Logudbrian) bonissja "goodness’; 

-ia: French courtoisie ‘courtliness’, \lst\.pazzia ‘madness’. Span, alegria ‘happiness’. Rum. 
anfie ‘wealth’, Sard (Logudorian) nobilia ‘nobility’. 

Agentive or instrumental nouns in -ore(m) and -ariu(m) are still some¬ 
times productive: 

(45) French chercheur ‘researcher’,ye;7H;er ‘farmer’; Span, hablador ‘talker’, cartero 
‘postman’; Ital. cacciatore ‘hunter’, calzolaio ‘cobbler’; Cat. bevedor ‘drinker’, 
mariner ‘seaman’; Rum. cmtdtor ‘singer’ (not directly from Latin cantator, 
but from cint + suffix -ator), plugar ‘ploughman’; R-R (Surselvan) mutgider 
‘dairy-man. milker’; Sard (Logudorian) piskadore ‘fisherman’, krabardzu ‘goat¬ 
herd’. 

The originally diminutive suffix -ellu has reflexes everywhere (though 
sometimes it is no longer a separable morpheme in modern languages, as 
in French couteau, Ital. coltello. Span, cuchillo ‘knife’< cultellus ‘little 
knife’ from culter ‘knife’). Romance derivatives include: 

(46) French ruelle ‘little street, alley’, Ital. paesello ‘small village’. Span, bolsillo 
‘small bag, purse’; Rum. bdrhdtel ‘little man’, Sard kateddu ‘little dog, puppy’. 

Other adjectival suffixes that have, on the whole, remained productive, or 
at least separable, include: 

Adjectival suffixes 

-ALE(M): French /n'verna/‘wintry’, morie/‘fatal, mortal’ (with a distinction between learned 
and popular development), Ital. invernale, mortale, Sard (Logudorian) istadi- 
ale ‘summery’, Span, gubernamental ‘governmental’. Rum. pastoral ‘rustic’ 
(possibly a French borrowing); 

-bile(m): French bldmable ‘reprehensible’, Ital. agevole ‘easy, manageable’, Span, razonable 
‘reasonable’, R-R (Surselvan) kusteival‘cosi\y\ Sard (Logudorian) addegibile 
‘decent’. Words like Rum. remarcabil are borrowings (cf. vandabil ‘saleable’ 
from French vendable) and it seems that this suffix has never been productive in 
Rumanian. 

-ivu(m): French penj;/‘thoughtfur, Ital. tardivo. Span, tardio ‘tardy, backward, dilatory’. 

Rum. dulciu ‘sweetish’, R-R (Engadine) sulaglif‘sunny’, Sard bagantiu ‘fallow 
[vacant] (of field)’; 

-osu(m): French ennuyeux ‘boring’, lisd.pauroso ‘fearful’, Span, arenoso ‘sandy’. Cat. 

coratjos’ ‘courageous’. Rum. barbos ‘bearded’, Sard (Logudorian) fodzosu 
‘leafy’. 
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-ense(m) is also fairly productive, especially to form adjectives from place- 
names: 

(47) French bolognais, namiirois; Ital. milanese; Span, aragones. Rum. englez, Sard 
kagliaresu. 

2.4 SHARED LEXICON 

Anyone familiar with one Romance language will have little difficulty in 
recognizing in the vocabulary of another at least half of the common 
words: Rumanian will present more vocabulary difficulties than any other. 
In the case of some pairs of languages - Spanish and Portuguese, for 
instance, or French and Occitan - some 90 per cent of the ‘basic’ vocabu¬ 
lary will look familiar. Italian is again the ‘archetype’ - the most ‘central’ 
of the languages, which has some 80 per cent of its ‘basic’ vocabulary in com¬ 
mon with most languages. 

It is not surprising, then, that among Romance speakers there is some 
degree of mutual intelligibility or that, at any rate, they have comparatively 
little difficulty in learning each other’s language. Barriers are raised more 
by differences in phonetic shape than by lexical differences. In the next two 
sections I shall exemplify first from the numerals in Romance and then 
from other ‘basic vocabulary’. 

(a) Numerals 

The Indo-European names of numbers have on the whole persisted into 
Romance, but there are some noteworthy differences. 

(i) Numerals 1-10 

In all the Romance languages the cardinal numbers 1-10 are clearly related 
to their Latin counterparts. The indefinite article is often identical with the 
numeral 1, and agrees with the noun to which it refers. In Dolomite 
Rhaeto-Romance varieties, however, the reduced article form can be used 
in conjunction with the numeral 1: 

(48) Gardena um na rama ‘one branch’. 

The number 2 (dCo, dCae) inflected for gender in most languages at an 
earlier stage and still does in some languages: 
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(49) (m./f.) Rum. doUdoud, Sard duosiduas, Ptg. doisiduas, R-R (Miistair) doH dos, 
Ital. dial. (Venice) duldo. 

In Italian the former feminine due has ousted the old masculine and 
‘neuter’ forms dui and dua (the latter survives in Sard). 

In Swiss Rhaeto-Romance varieties ‘neuter" or ‘collective’ forms for both 2 and 
3 (tres, tria) are attested: 

(50) Engadine dua (traia) bratscha, Surselvan dua (trei) bratscha ‘two (three) arms’. 

In some languages numbers between 3 and 10 are less uniformly derived. 

In the Rumanian dialects of the Istrian peninsula, Croatian forms are found 
alongside the Latinate ones, for the numbers 5-10 (for 9 the Latin form has 
apparently not survived anywhere and for 8 and 10 the Croatian form has oust¬ 
ed the Latin one in some varieties): 


Istro-Rumanian numbers 5~S 


5 [cine]/ [pet] 

Daco-Rumanian cinci 

< QUINQUE 

6 [fase]/ [fest] 


< SEX 

7 [fapte]/[sedjTn] 

^pte 

< SEPTEM 

8 [opt]/[osjTn] 

opt 

< OCTO 


For higher numbers Slavonic numerals are preferred in both varieties; but for 
5-10, Southern varieties (which had until recently more contact with North 
Italian dialects) seem to use the Latinate numerals (and other basic vocabulary 
items) more than the Northern do. 

(51) South cine zile / North pet rfm'five days’. 

Both varieties use Latinate numerals for counting domestic animals, but 
Croatian numerals for telling the time: 

(52) Ifapte kali] ‘seven horses’ / [sedpn mi' nutii] ‘seven minutes’. 

Angolar creole, spoken on Sao Tome by some 9,000 descendants of runaway, 
or shipwrecked, slaves, uses Portuguese forms for 1-3, but for higher numbers 
Bantu (Kimbundu) forms are current. 

(ii) Teens 

In Daco-Rumanian, numerals 11-19 are in origin periphrastic (‘one, two, 
etc.-on-ten’): 


(53) 


unsprezece, doisprezece, etc. (reduced in colloquial usage to [unjpe] etc.). 
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It is suggested that although lexically derived from Latin (UNU - super - 
DECEM etc.) these forms are modelled on Slavonic. In Arumanian dialects 
south of the Danube, however, 21-9 are constructed, unlike Slavonic, on a 
similar pattern (umsprdyginti ‘twenty-one’ etc.). 

The late teens in most Romance languages are also periphrastic; 

(54) 17; 10-7: French dix-sept, Cat. dissei, Sard deghesette, 10 ac 7: Ital. diciassette, 

Ptg. dezassette; 10 ET 7; Span, diecisiete, Old Occ. detze e set. 

For 11-15, though, the Latin type undecim etc. survives. For the number 
16 different solutions are chosen: 

(55) French seize, Ital. sedici. Cat. seize, Engadine saidesch < sedecim / Span diez y 
seis, dieciseis, Ptg. dezasseis. 

(iii) Tens 

The Rumanian names for ‘tens’ are periphrastic (doudzeci, treizeci ‘2/3 
tens’ etc.), compared with reflexes of viginti and triginta etc. in other 
languages (Ital. venti, trenta-, French vingt, trente. Span, veinte, treinta). 
Sometimes held to be Celtic in character are such forms as: 

(56) standard French soixante-dix ‘60-10’ (70); quatre-vingt(s) ‘4 x 20(s)’ (80). 

Belgian and Swiss French septante and octante fit better into the general 
Romance pattern. 

(iv) Hundreds 

Rumanian uses a Slavonic sutd ‘hundred’ (Istro-Rumanian sto), instead of 

the CENTUM that survives elsewhere: the Rumanian slang form centd ‘100 
lei note’ may be a French borrowing. 

For the hundreds most languages prefer an analytic construction: 

(57) Rum. doud sute, Ital. duecento. Cat. descents, R-R (Engadine) diatschient, 
French deux cents. Span, doscientos ‘200’ (i.e. ‘2 - lOO(s)’ compared with the 
Latin lexicalized ducenti). 

Nevertheless some less transparent etymological forms survive; 

(58) Sard dughentos, treghentos, Ptg. duzentos, trezentos ‘200,300’ (DUCENTI, TRECEN- 

Tl); Span, quinientos, Ptg. quinhentos ‘500’ (QUINGENTI). 



90 


The similarities 


(b) Basic vocabulary 

If we consider the ‘most representative’ Romance vocabulary of some 
3,000 words (measured by frequency, morphological productivity and 
semantic range), about a fifth of the inherited Latin forms are pan- 
Romance, and many more are shared by several languages. The factors 
that make for similarity and differentiation are discussed in chapters 3,4, 6 
and 7. Here let us illustrate the similarities in ‘basic’ vocabulary by examin¬ 
ing Romance translations of a version of the ‘Swadesh’ word lists, devised 
to provide a practicable means of comparing common vocabulary items 
across languages. One count using a limited range of languages concluded 
that pairwise scores of ‘look-alikes’ range from 70 per cent 
(Italian-Rumanian) to 89 per cent (French-Italian and Spanish- 
Portuguese). Similar counts on Romance creoles reckon that their basic 
vocabulary is often well-nigh identical with that of their lexifier languages. 

Some creoles, however, have incorporated so much vocabulary from 
other sources that they can no longer be called Romance. This is particu¬ 
larly so for Saramaccan, spoken by some 20,000 descendants of runaway 
slaves (maroons) in Surinam. Though some maintain that it was once a 
Portuguese-based creole, it now is reckoned to have only some 37 per cent 
of basic Portuguese lexical items, compared with 54 per cent English, 4 per 
cent Dutch and 4 per cent African. The Romance linguist Schuchardt, one 
of the first to take a serious interest in creoles, in 1914 wrote an article on 
this ‘Bush Negro’ variety, known by its speakers as Djoe-tongo ‘Jew lan¬ 
guage’, presumably because it was allegedly used on plantations owned by 
Jews from Brazil. His information was based largely on reports from 
Herrnhuters (Moravian Brethren missionaries), and he estimated, at that 
time, that the English lexical stock was almost equal to the Portuguese - 
the personal pronouns (except perhaps mini) and the TAM markers (see 
2.2) were already of English origin, but prepositions {com, no) and adverbs 
like aqui were still Portuguese, as were common adjectives like bom, grande 
and pequen{in)o. 

Another ‘maroon’ creole, Angolar in Sao Tome, still has, on the other 
hand, nearly 70 per cent of its basic lexical items cognate with Portuguese. 
Curasao papiamento is also still recognizably Romance even though some 
28-30 per cent of its vocabulary is clearly of Dutch origin, located mainly 
in less ‘basic’ areas of the lexicon, but including, for instance: 

(59) lep ‘lip’, smal ‘narrow’, stui ‘chair’, hel ‘yellow’. 
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Of the 100 words in one version of the Swadesh list the following forty- 
three seem to be pan-Romance - the English gloss appears in alphabetic 
order and apparent Latin etymons are given in the form which seems to 
have survived (nouns are usually from the Latin accusative): 

Pan-Romance basic vocabulary 

all TOTU; and et (but modern Daco-Rumanian < sic ‘thus’); animal animale 
(also BESTIA); blood sangue; bone osse; come venire; die mori[re]; drink 
bibere; ear auricula (though in Spanish oido is more polite than oreja when 
referring to humans); egg ovu; eye oculu; fire focu (for igne) (though words 
for ‘light’ are also used: Span, la lumbre, Sicilian lu luci = ‘fire’ / la luci = ‘light’); 
fish pisce; full plenu; good bonu ; green vir(i)de; horn cornu; I ego (cf 2.2); 
know cognosc[ere] (most languages distinguish ‘know for a fact’ *sapere 
(sapere)/scire from ‘be acquainted with’ cognoscere, though Louisiana cre¬ 
ole has generalized the connaitre form, and Rumanian popular usage substi¬ 
tutes cunoa^te for j/i); leaf foliu (though fronde is also widespread); louse 
PEDUCULU; moon luna (in standard Rumanian the same word is used also for 
‘month’ as in Slavonic); name nomine (in Haiti this word is used only for the 
forename, while the surname is signati - French signature)', new Novu; night 
nocte; not non (rivalled in French and modem Occitan by pas, used in French 
creoles as the main negative particle); one UNU; person persona (in most 
French creoles (di)mun < French (du) monde ‘world, people’, is the usual 
word); rain PLUVIA (the word for ‘water’ aqua is used more often in Sardinia 
and southern Italy); round ROTUNDU; see VIDERE; skin pelle; sleep dormire; 
smoke FUMU; star Stella; stone petra (in Haitian creole \pie] < pierre is spe¬ 
cialized in the sense of ‘fetish’ and roche ‘rock’ is used more generally); thou TU 
(see 2.2); tongue LINGUA; tooth dente; two duo; water aqua; we nos (see 2.2); 
what QUID. 

In seven cases different derivatives or related forms are used but it is not 
difficult to discern a relationship: 

Related basic lexical items in Romance 

ashes cinere, Ital. cenere, French cendre, R-R (Engadine) cendra / cinisia 
Dalmatian (Veglia) kanaisa, Sard kijina. Span, ceniza, Ptg. cinza, Rum. cenu^d 
(scrum ‘cold ashes’ is similar to words in Bulgarian and Albanian); fly volare 
French voler I exvolare Rum. zburd ; sit sedere French ( as)seoir / sedentare 
Span, sentar (Haiti [f ita] seems to come from the French imperative assied-toi, 
Mauritius [asize] is clearly formed from the French past participle assis - cf 
[debute] ‘to stand up’ < debout ‘standing’); sun sole Ital. sole i soliculu 
French soleib, this istu/ecce-istu and that illu/ecce-illu; who qui French qui, 
Ital. chi I QUEM Span, quien, Ptg. quern. Rum. cine. 
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Sometimes Rumanian is the odd-man-out: 

Basic lexical differences in Rumanian 

cold FRiGiDU: Rum. rece recens ‘fresh’ (cf. Old French roisent; the Rumanian 
noun frig ‘coldness’ is from frigus, frigoris); dry siccu: Rum uscat exsucatu 
(cf. Ital. asciugare ‘to dry’ etc.; Rum. sec is less popular); earth terra: although 
{ara = ‘land’ in Daco-Rumanian (also = ‘earth’ in Arumanian), earth is rendered 
in the standard language by pamint pavimentu (cf Sard pamentu ‘floor, ground’); 
foot pede: but Rum. picior < peciolu ‘stalk’ cf Ital picciuolo ‘petiole’); heart 
cor: Rum. in/md anima ‘spirit, soul’ (cf French dme); man homine: Rum bdrbat 
BARBATU ‘bearded one’ / om ‘human being’ (in French creoles the equivalents of 
the French word negre or bougre are often used to designate a male human); 
mouth BUCCA (for os, ORis): Rum bucd keeps the Latin meaning ‘cheek’ while 
‘mouth’ is rendered by gura{avLA ‘throat’, but cf French gueule ‘jaw’, used 
slangily and in creoles for ‘mouth’); tree ARBORe: although in Old Daco- 
Rumanian and in Arumanian arbure = ‘tree’ (also Megleno-Rumanian arbur 
= ‘oak’), an Albanian (or Thraco-Dacian) word copac is the usual Daco- 
Rumanian general word, while pom (POMU) means ‘fruit-tree’ (in many parts 
of Italy PLANTA ‘plant, sole of the foot’ is used for ‘tree’ while Haitian Creole 
uses [pje] < French pied‘foot’ and in the Engadine bosch, of Germanic origin 
(cf French bois ‘wood’) is preferred to alber). 

It will be seen that the deviations in Rumanian seem to stem mainly 
from a differentiation in Latin vocabulary rather than borrowing from 
other languages. As far as ‘basic’ vocabulary is concerned this is true for 
many of the lexical differences between the modern Romance languages, 
as we shall see later. Not infrequently similar differences are found 
between neighbouring and mutually intelligible dialects; indeed it can be 
claimed that speakers in one small community may even, for normal use, 
select, from the possible synonyms, those items that are not used in a near¬ 
by village. In this way, the community can emphasize its separate identity, 
without impairing bread-and-butter communication (see 5.4). 

Differences in basic vocabulary between creoles and ‘core’ languages 
seem often to be the consequence of semantic shifts that may be connected 
with environmental and sociocultural conditions. French creoles, for 
instance, often use more archaic or popular words than the standard; 

(60) ba(J) < bailler for donner ‘give’, guetter ‘to watch out for’ for regarder ‘to look 

at’, rete < rester ‘stay’ for habiter ‘inhabit’. 

Originally nautical words are also generalized - like amarrer ‘to moor a 
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ship’, chavirer ‘capsize’ and so on. This sort of generalization is found in 
other Romance languages: 

(61) *ADRIPARE ‘to come to shore’ > Ital. arrivare, Sard arribare, French arriver. 

Cat., Occ. arribar ‘to arrive’, plicare ‘to fold’ (of sails) > Span. Uegar, Ptg. 
cliegar ‘to arrive’. Rum. apleca ‘to leave’. 


2.5 LOANWORDS 

Borrowing from other languages is mainly a factor in the differentiation of 
Romance lexicon, which will be discussed in chapters 6 and 8. The stan¬ 
dard languages have had in common, from late medieval times, the tenden¬ 
cy to borrow more extensively from Latin (see 3.4) and from other 
Romance languages than from non-Romance tongues, while dialects tend 
to borrow from their national language (see 4.8). 

Isolated Romance-speaking communities are clearly under pressure to use 
non-Romance forms: an extreme example are the tiny Istro-Rumanian groups 
that have at their disposal a number of synonymous pairs of Rumanian and 
Croatian words, especially in verb aspectual pairs in which a Romance form is 
used for the imperfective, and the Slavonic for the perfective: 

(62) be (bibere) / popii ‘to drink’; munkd (’"manducare) / poidii ‘to eat’. 

Similarly papiamento in Curasao often alternates between equivalent items 
of Romance (Spanish/Portuguese) and Dutch origin in its vocabulary - see 
example (59) above. Creoles have to a differing extent borrowed from African 
or Asian languages. We may compare, in this respect. Cape Verde with Sao 
Tome Portuguese creoles, with the latter admitting many more African words: 

(63) ‘body’ Sao Tome ubwe I Cape Verde korpe (Ptg. corpo)\ ‘empty’ Sao Tome duji I 
Cape Verde baziu (Ptg. vazio). 

Massive borrowing from neighbouring languages can be a sign of immi¬ 
nent ‘language death’, as speakers shift their loyalty from one language to 
another: this is presumably what is happening to Istro-Rumanian, and to 
many Asian Portuguese, and Philippine Spanish varieties. The 
Saramaccan and papiamento examples are of different character, involving 
the complex, and not yet fully understood, process of creolization, which I 
shall return to in chapter 5. 
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2.6 HOW CLOSE ARE ROMANCE LEXICONS? 

Although many of the common lexical items in the Romance languages 
are cognate, they may have very different phonetic shapes in different 
languages. Most of the studies on divergence between languages are con¬ 
cerned principally with the phonetic changes that have cleft the languages 
apart. But in many cases it is fairly easy to formulate translation rules 
which allow speakers to recognize their own indigenous items in the unfa¬ 
miliar shapes of other languages. For instance, the voicing and opening of 
intervocalic voiceless stops in Spanish, compared with their preservation 
in Italian, causes little hindrance to recognition when the regular corre¬ 
spondence is discerned: 

(64) [bida] / [vita] ‘life’; [amiyo] / [amiko] ‘friend’. 

More difficulty is caused by the complete disappearance of these stops in 
modern French, so that there is no way of discovering which stop is missing: 

(65) [vi], [ami]. 

However, the difficulty is reduced by the existence of ‘learned borrowings’ 
which are semantically and phonetically related to the ‘popular’ words, but 
which still preserve the original stops: 

(66) [vital] ‘vital’, [amikal] ‘friendly’. 

In SO far as speakers link [vi] with [vital], for instance, it is possible that 
they are conscious, synchronically, of the type of close resemblance 
between the lexical items of different Romance languages that would oth¬ 
erwise have been rent asunder by diverse phonological developments. 

Thus one may make bold to suggest that, from the late medieval period, 
the predilection for Latinisms has closed some of the gaps that were 
opened in the Dark Ages (see 3.4). Latinisms are particularly frequent in 
the fundamental vocabulary of Italian, Spanish, French and Portuguese 
(in that order), and least frequent in Sard and Rumanian. If we take into 
account both inherited and borrowed Latin vocabulary, the most 
‘Romance’ of the languages is Italian, and the least, Rumanian. 

2.7 ROMANCENESS AS A CONTINUUM? 

Latinizing tendencies, especially in lexicon, but also in grammar are a 
mark of literary languages that form part of the Romance ‘club’ - of which 
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Rumanian is a fairly recently joined member - and trickle down to a 
greater or lesser extent to the varieties that shelter under the ‘roof’ of these 
languages. We shall see in the next chapter that this ‘club’ of languages 
also shares certain features that are absent from Latin. One answer to the 
question we asked originally - ‘What is a Romance language?’ - could be 
answered by reference to the ‘club-land’ metaphor, which I shall develop 
further in chapter 4. A Romance language is one whose speakers have a 
representation of their language that associates it with prestigious ‘Neo- 
Latin’ club members, and especially with Latin itself As speakers of such 
varieties that lack the status associated with the nation-state - like 
Catalan, Occitan, Galician, Romansh, Sard - seek to nurture self-esteem, 
they will usually assert their independent affiliation to Latin, as full mem¬ 
bers of the club and not merely satellites of the ‘big’ languages. Other vari¬ 
eties - like some of the creoles - seek their cultural identity elsewhere. 
Others are doomed to decline and death as speakers disappear or shift 
their allegiance to other languages. This is happening now to Asian 
Portuguese, to Louisiana French, to Judeo-Spanish, to Istro-Rumanian 
varieties, as it has in the past to the Romance varieties that were certainly 
spoken in Dalmatia and in some areas that are now German-speaking, 
and to those probably spoken in North Africa and Britain. 

But, of course, not just any language can join the club. It would have 
first to be related to the Romance family. This is a classic metaphor in 
comparative philology, which refers to a group of languages that are relat¬ 
ed, in the sense that they are modern variants of what was in the past a sin¬ 
gle language. The (inexact) analogy, with a parent and her children, 
reflects the view of languages as organisms that are born, grow old and die. 

Here I have used lexico-phonetic correspondences among basic lexical 
items and grammatical morphemes as the bench-mark for family member¬ 
ship. Where to draw the line is of course an arbitrary decision: perhaps a 
language with more than 50 per cent of core items that are recognizably 
Romance can be included. This would exclude English (even though in the 
wider vocabulary numerous French and Latin words are used) and 
Saramaccan. It would include most other ‘Romance creoles’, even 
Angolar, and, still (but for how long?) Istro-Rumanian. 

Within the family there would still be more or less distant relatives; I 
have suggested that Italian seems to be at the centre of the family group, 
with Rumanian and French (including French creoles) more on the 
periphery. Rumanian, unattested before the sixteenth century, seems to 
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have been drawn further into the family since the eighteenth century, when 
its more prestigious speakers began to solicit for membership of the ‘club’, 
by favouring Romance lexical variants at the expense of Slavonic, Turkish, 
Albanian or Greek. 

I have broached, without resolving it, however, the question of a 
Romance ‘type’, which would be defined by synchronic morphosyntactic, 
rather than by sociocultural or historical, criteria. I shall pay more atten¬ 
tion to sociocultural factors in chapter 8, while in chapter 5 I discuss cre- 
olization as a historical process. 

One thing I think I have demonstrated is that, at the extreme, creoles can 
have a different sort of verbal system from other Romance languages, even 
though the ‘Romance verb’ and the ‘creole verb’ themselves show a great 
deal of variation and sometimes overlap. I have also maintained that a 
nominal two-gender system, as distinct from the Latin three-gender sys¬ 
tem, is a Romance characteristic, not found in any of the creoles. 

Most, but not all, of the common features of Romance are inherited 
from Latin. In the next chapter I shall look more closely at the relationship 
between Latin and Romance, emphasizing the differences as well as the 
similarities. In chapter 4,1 shall consider especially features that are shared 
by most Romance languages but absent from Latin. 


FURTHER READING 

On ‘family’ relationships and reconstruction, Polome & Winter eds, (1992). On 
creoles. Holm (1988-9). Volume I of Posner & Green eds. (1980- 93) gives an 
account of lexicological studies in Romance, with a complete bibliography up 
to that time. More recent important comparative works include Sala ed. 

(1988), a detailed study of the ‘representative’ vocabulary of nine Romance 
languages and of Latin, and comparison between them. Vemay (1991-), of 
which four volumes have appeared to date (1996), is less a dictionary than an 
account of Romance words for selected concepts grouped according to their 
semantic field (‘moral concepts’, ‘parts of the body’, etc.). On word formation, 
see Kursschildgen (1983), Klingebiel (1989), Malkiel (1989). On etymology, see 
especially Malkiel (1992,1993). Most comparative works concentrate on dif¬ 
ferentiation of vocabulary; for instance Rohlfs (1986). On -mente, Karlsson 
(1981). 
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3.1 DERIVATION 

I asked in the last chapter ‘What is a Romance language?’ For most 
Romanists, the answer is obvious: ‘It is a language derived from Latin.’ 
But what is meant by derivation is not obvious. As a metaphor, based on 
the Latinate form de-rivus ‘from a stream’ (implying originally ‘diver¬ 
sion of a water channel’), we can conjure up a picture of a river flowing 
from its source, expanding as it is fed from various other sources and split¬ 
ting up into an estuary as it nears the sea. An even more vivid metaphor is 
that of a tree, from the trunk of which various branches sprout. But in 
both these cases the parent remains, with the off-shoots still part of it. The 
comforting metaphor of a family, which I used before, is misleading too, 
because that implies discontinuity and the birth and death of discrete indi¬ 
viduals. 

What we are looking for is a metaphor that implies continuity and 
diversification, but also the slow withering away of the parent stock: my 
far from glamorous suggestion is that of vegetative reproduction, as with 
a rhizome. The Romance languages, the newer sprouts, are living versions 
of Latin, a root-stock that is now dead - in the sense that it is now repre¬ 
sented only by a closed corpus of texts and that it is no longer the mother- 
tongue of any speaker. How and when the death of Latin came about we 
shall discuss later. 

I have already emphasized that traditional comparative philology 
assumes that distinct languages which share a large number of basic lexi¬ 
cal elements, the phonological shapes of which are related in a regular 
way, must at an earlier period have been one and the same language. The 
methods developed for reconstruction of the original proto-language are, 
as we have seen, vindicated by the results of Romance reconstructions. 
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which often closely match the actually attested Latin forms. But it is also 
true that some shared features of the Romance languages are not attested 
in Latin; hence the disputes about how reconstructed Proto-Romance, or 
Vulgar Latin, relates to the language of our Classical texts. 

3.2 PROTO-ROMANCE OR VULGAR LATIN 

The much-used term Vulgar Latin fills some Romanists with consterna¬ 
tion, because it is used in many vague, and sometimes contradictory, ways: 
one commentator counts thirteen distinct meanings for the term. It 
implies for most users the Latin of ‘the people’, unsullied by excessive 
grammatical learning, lively, and expressive, but, perhaps, slovenly, unsub- 
tle and imprecise. Recall (page 3) that some sixteenth-century thinkers 
claimed that even in Republican times the Roman populace may have spo¬ 
ken Italian. They did not envisage that languages change over time, merely 
that one social variant replaces another, when there is decay of social con¬ 
straints. The idea that change implies decay from God-given complexity 
and uniqueness towards animal-like communicative simplicity survived 
well into the era of scientific linguistics. Classical Latin would, in the view 
adopted by most respected nineteenth-century linguists, be maintained 
only by discipline and educational rigour, while a relaxing of standards 
gave rise to a less intellectually subtle, but perhaps more crudely expres¬ 
sive, vulgar version of the language. Thus the sources of Romance were to 
be sought in scantily attested colloquial Latin. It was well known that 
social differences in Republican Rome were stark and it has even been sug¬ 
gested that the non-senatorial class had virtually no access to the prestige 
language that was committed to writing. 

The notion that the proto-language of Romance could be reconstructed 
in a way similar to that used for unattested Proto-Germanic or Indo- 
European did not catch on until quite late in the nineteenth century. Only 
more recently has much attention been paid to the discrepancies between 
the sparse textual evidence for colloquial Latin and the reconstructed 
Proto-Romance. 

(a) Popular Latin 

It could not fail to strike observers that early Latin and Italic dialects dis¬ 
played some of the features that resurfaced in the late texts that foreshad- 
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owed Romance. Let us take as an example syncope, a term used for amply 
attested Late Latin truncation of a penultimate syllable containing a short 
vowel (I or u), in classical forms in which the accent falls, exceptionally, 
on the antepenultimate syllable: 

(1) vlRDE for vIr!de(m) ‘green’ (cf. Ital. wrde etc.); AURICLU for 
aurIcOlu(s) ‘ear’ (cf. Ital. orecchio etc.). 

Such truncation is usually associated with the postulated change from 
tonal to stress accentuation in later Latin: unaccented vowels tend to 
reduce and disappear between two stressed syllables, whereas the contrast 
between high-tone and low-tone vowels would not be expected to produce 
the same effect. There is no universal tendency to reduce and drop vowels 
in atonic syllables, so perhaps in Latin the vowels written I and C were 
schwa-type vowels that were in some sense evanescent. 

Evidence suggests that early Latin may also have had stress accentua¬ 
tion: certainly there are examples of apparently syncopated forms from 
this period: 

(2) CALDUS, for CALlDUS ‘hot’ (cf Ital. caldo). 

The Emperor Augustus is said to have regarded the unsyncopated form of this 
word as affected. 

Some scholars believe that the tonal accent, described by Latin grammari¬ 
ans, may in fact have been an artefact of the literary language, imitated 
from prestigious Greek. Some of the short vowels found in classical penul¬ 
timate syllables may then be a consequence of insertion (anaptyxis) 
between certain consonant sequences, perhaps favoured by fashionable 
‘tonal’ pronunciation. In this case it is likely that it was never totally 
adopted in the colloquial language. A parallel can be drawn with later 
developments in Romance. In standard Italian, for instance, unsyncopated 
forms apparently survive (see 7.2): 

(3) Compare Ital. tavola with French table < tabOla; tabla in the Appendix Probi 
(note that bl was a very infrequent cluster in Latin). 

Here we seem to be faced with ‘semi-leamed’ forms; ‘regular’ counterparts 
would derive from a form like tabla, *tavla, *taula, which ends up as: 

(4) Ital. dial, tola, tabia in various specialized uses like ‘shoemaker’s bench, chess¬ 
board, etc.’, French tdle ‘sheet metal’. 
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Compare also: 

(5) modem Ital. anima ‘spirit, soul’, Rum. tnima ‘heart’ / Old Ital. ahna, anna, 
French dme (old arme). Span., Ptg. ahna < anIma. 

So-called sdrucciole, with antepenultimate stress, are fairly common in 
Italian, not only in learned words. There is often hesitation about the cor¬ 
rect pronunciation, with, on the whole, avoidance of accentuation of suf- 
fix-like sequences like -ola, -ula. Evidence that there has been restoration, 
rather than retention, of the atonic penultimate is provided by the not 
infrequent lack of correspondence between the vowel and that of the 
Latin etymon: 

(6) attimo ‘moment’< atomu; debole ‘weak’< debile; sedano ‘celery’< selinon. 

The suggestion that lower class Latin continued to use syncopated forms 
throughout the classical period is supported by evidence from graffiti from 
Pompeii (which was destroyed by an eruption of Vesuvius in AD 79): 

(7) DOMNUS for DOMtNUS ‘master’; coliclo for caulIculus ‘little cabbage’ (also 
with ‘popular’ reduction of diphthong au to o); manuplus for mAnIpOlus 
‘ bundle’. 

There was also abundant attestation in later texts. 

In some sequences the short vowel in the penultimate syllable seems to have 
distinctive value: 

(8) MANCUS ‘maimed’ > Ital., Span., Ptg., Cat. manco. Old French, Occ. mane / 
MANiCUS ‘grip’ > Ital. manico. Span., Ptg. mango, French manche, Occ. morgue. 

One type of social variation is obvious to all: different choice of vocabu¬ 
lary. In many cases the Latin word that survives into Romance is more 
popular in its flavour than the word preferred in high literary style. Among 
common words we can cite: 

Pan-Romance popular Latin words 

BELLUS ‘fine, beautiful’ which survives in all Romance languages, although in Spanish, 

Portugese and Sard it is probably a loanword from Italian. Pulcher is quite 
lost, while FORMOSUS ‘beautifully formed’ persists in Spanish, Portuguese and 
Rumanian. 

CABALLUS ‘horse’, originally a Celtic word, replaces equus everywhere. 

Os ‘mouth’ is replaced by bOcca ‘puffed-up cheek’, or gCla ‘gullet’. 

FLERE ‘to weep’ gives way to plorare ‘to lament, to wail’, or plangere ‘to bewail’. 
ludEre ‘to play’ is replaced by iocare ‘to joke’. 
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PORT ARE ‘to carry’ - a word already used by the early comedian Plautus - replaces ferre ‘to 
bear’. 

Edere ‘to eat’ survives in derived forms like Span, comer; some other languages use a verb 
formed on the name of a gluttonous character of comedy - mandCcAre. 

The popular idiom, and Romance, often preferred longer, derived forms 
to a shorter word. Diminutives were particularly favoured: 

(9) AURICULA, AViCELLA, genOculum ‘little ear, bird, knee’. 

So were frequentatives: 

(10) CantAre ‘to keep on singing’ for canEre ‘to sing’. 

(b) Italic dialect 

It can be argued that the differences between reconstructed Proto- 
Romance and Classical Latin are great enough for us to postulate another, 
related language as the ‘mother’ of the Romance family. Some, for 
instance, postulate a variety that deviated from Latin at an early stage, 
perhaps before the third century BC, when the earliest literary texts were 
written. Others, more radically, suggest that a regional dialect (probably 
from southern Italy) was at the origin of the spoken language that spread 
through the Empire. For instance, reconstruction of the medial cluster in: 

(11) French mil, Ital. nolle. Span, noche. Rum. noaple ‘night’ 

which are usually derived from nocte, causes difficulty, because of the 
deviant Rumanian reflex. A plausible proto-form would have /k'^t/; the QU 
/k'^/ of Latin, as in aqua ‘water’, comes out in Rumanian like the second 
labial element: 

(12) Knm.apdHiaX.acqua. 

Note that /k"t/ and /kt/ could be related in a similar way as literary -cOl- and 
syncopated -cl-. It may also be that Latin QU represented a cluster /kw/ rather 
than a unitary phoneme. 

Even more controversial is the suggestion that the Romance mother may 
have been an umlauting (metaphonic) language, in which a final I or u raised 
the vowel of the preceding tonic syllable. Such umlauting is found in 
various Romance dialects, particularly in central and southern Italy. The 
‘mass neuter’ forms of Asturias and Abruzzo (see 1.4) can be connected 
with the same phenomenon. There are traces too of umlauting in French: 
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(13) fis < FEcr ‘I did’; il < *illT ‘he’. 

More widespread is the diphthongization that some see as a eonsequenee 
originally of umlaut (see 4.2). 

It has been claimed that Proto-Romance can best be reconstructed as an 
umlauting language, like Germanic languages. As Latin, even in its popu¬ 
lar varieties, shows no signs of umlaut it could not therefore be equated 
with the Romance mother-language. 


(c) Late Latin 

It is generally assumed that after the Latin Golden Age (c. 90 BC to AD 
14), as social structures became less rigid and linguistic habits became 
more variable with the spread of the Empire, there was relaxation of the 
tight control over the written language that had characterized the classical 
period. Even those who do not subscribe to the view that popular Latin 
always differed considerably from the prestigious variety recognize that 
the later Latin texts display certain features that foreshadow Romance. 

Petronius, who died in AD 66, is often cited as one of the heralds of the 
Romance languages. Elis depiction of vulgar feasting in Cena 
Trimalchionis includes well-known ‘Vulgar Latin’ features, such as syn¬ 
cope: 

(14) corpus. OCLUS. cai.dus, oricla, etc. 
and ‘popular’ vocabulary: 

(15) BELLUS, PLORARE, BUCCA, CABALLUS, MANDUCARE, CtC. 

The letters of Claudius Terentianus from the first quarter of the second 
century AD - personal letters, written in Egypt by professional scribes 
(both in Greek and in Latin) - are recently studied texts among the many 
papyrus documents now available. They show even more evidence of 
‘Vulgar’ features. 

The best-known early ‘Vulgar text’ is the Appendix Probi, a curious list 
of correct and incorrect spellings forming part of a seventh- or eighth- 
century manuscript. Its date and place of composition is disputed. If 
indeed it is from the third century, it provides ample mainly phonological 
evidence, especially of syncope: 


( 16 ) 


VIRDIS, TABLA, OCLUS etc. 
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of jodization: 

(17) ALIUM NON ALBUM, COCHLEA NON COCHLIA, etc. 

and of loss of final -m; 

(18) OLIM NON OLI, PASSIM NON PASSI. 

It is disappointingly inconclusive on some other changes that we think 
should have happened by then. One problem about the text is that we do 
not know what its purpose could have been: a school exercise? A consulta¬ 
tive manual for scribes? Certainly the choice of words in the list is eccen¬ 
tric and does not always reflect Romance outcomes. 

Another text that is given favoured status in the study of Vulgar Latin is 
the so-called Peregrinatio Aetheriae (or Egeriae), which we have in an 
incomplete eleventh-century manuscript. It is a nun’s first-hand account of 
a pilgrimage to the Holy Land, dated, on external evidence, to between 
AD 363 and 431. The value of this text is that it provides evidence of syn¬ 
tactic changes, and in particular of the comparative infrequency of the 
verb-last ordering that characterized Classical Latin. The Terentianus let¬ 
ters had also shown a similar trend: in all the Romance languages a SVO 
(subject-verb-object) ordering is seen as basic (but for preposing of object 
clitics, see 4.4). 

In the Late Latin period there were numerous grammatical treatises, 
presumably aimed at speakers who were conscious of their imperfect com¬ 
mand of Latin. Much of the evidence is ambiguous, but some does bear 
out the predictions suggested by our reconstruction of Proto-Romance. 

Texts and inscriptions in Latin continued to be produced throughout 
the Dark Ages. The ‘mistakes’ in these texts can be interpreted to match 
the changes that were to produce Romance, and as time goes on they tend 
to become more numerous. But clearly the writers were versed enough in 
the Latin tradition to make a shot at correctness, and they continued to 
use forms - like the Latin passive (see 4.7) and future (see 4.6) - which dis¬ 
appeared completely from Romance. 

How far can we guess what sort of language they were speaking? What 
we cannot say for certain is that the Latin of these texts represents the 
‘same language’ as reconstructed Proto-Romance. 
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3.3 DIFFERENCES BETWEEN LATIN AND 

ROMANCE 

(a) Language names 

The use of the collective term Romance has always been seen as indicative 
of the separate status of these languages. Latin is the name traditionally 
used for the literary language known from classical texts, as well as for 
Medieval Latin, and for Neo-Latin, as the Renaissance use of the language 
is designated, and for Church Latin. The name derives from the ancient 
peoples of Latium, but the standard and preferred form was always the 
usage of Rome. In ancient texts lingua romana seems to have been used 
synonymously with lingua latina, though often with the suggestion that 
the urban use was more sophisticated than the regional. 

For Classical Latin, the Romance languages now use, like English, 
Latinized forms of the name. These are learned borrowings that date, in 
French, from the twelfth century, and in Spanish probably from the fif¬ 
teenth or sixteenth centuries. Standard Italian latino could be a popular 
form, but Latin was known as litterae in the twelfth century, and gram- 
matica in the thirteenth (both implying a written language). In Rumanian 
there is no popular reflex of latinus. 

Popular reflexes, typically ladinio), are today used for some Romance 
languages, all in contact with non-Romance tongues: the now-extinct 
Dalmatian language, the written language used for literal translation from 
Hebrew and Aramaic texts by Sephardic Jews (whose vernacular is a 
Spanish variety), and Engadine and Dolomite Rhaeto-Romance. This 
originally bookish usage probably dates from the nineteenth century, but 
earlier similar names were used for many North Italian varieties. Till 
about the seventeenth century, the word was used, as it is today in the 
United States, for Spanish-speaking foreigners. 

Everyday words deriving from LATINUS in the Romance languages are frequent 
in the general sense ‘language’ (including for instance ‘birdsong’, also for ‘gib¬ 
berish’) and, as an adjective, with meanings like ‘clever’, ‘smooth’, ‘slippery’, 
‘nimble’, ‘easy’. Non-Romance speakers often use the name for Roman 
Catholics: in some Germanic dialects it means ‘bondsman’. Spanish Arabic 
used the name for Christians. 

One commonly held belief is that in the later part of the first century AD 
the names lingua latina and lingua romana were differentiated, with 
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the former indicating the written, learned language, and the latter one or 
several spoken Romance tongues that are not recorded in writing. It has 
been shown, however, that this distinction is not made in texts before the 
ninth century. Even then it is not wholly consistently made, though the 
addition of an adjective like rustica sometimes makes clear that lingua 
ROMANA is meant to designate vernacular use. 

Romance speakers throughout the Middle Ages tended to differentiate 
themselves from Germans or Slavs by calling themselves Romans; the 
Germans called them by varieties of the name Welsh (see, for instance, 
Walloon in Belgium, old Churwdlsch for Romansh or Surselvan, and Vlach 
for Rumanian). Today the Roman name survives only in regions where the 
language is in contact with other tongues - in Suisse Romande, in 
Rumania, in Romansh territories, and in sefardi Jewish communities. 

Names for the individual languages usually arose in the Middle Ages, fre¬ 
quently with a geographical or historical origin. Francisca seems originally 
to have been used for Germanic Frankish; for French romanz was still in use at 
the beginning of the twelfth century, to be replaced only later by franceis. In 
Occitan romans was usual, with lemosi and provenzal fairly rarely used before 
the thirteenth century. Langue d’oc seems to be a northern name, used from 
the late thirteenth century, and Occitan a modern usage. In Spain romance was 
the usual name before the second half of the thirteenth century, when Castel¬ 
lano (belonging to the ‘castle area’) began to be used. Espanol (formed on His- 
pania) dates from the end of the fifteenth century. In the west romango was 
current from the fourteenth century, portugues, from the sixteenth century. 
Catald dates from the fourteenth century; romanf was usual in the thirteenth 
century. In Italy, the vernacular was called volgare, though the name italiano 
was introduced first in the thirteenth century. For Rumanian we have little evi¬ 
dence before the sixteenth century. It is claimed that the name Vlach was not 
used by speakers themselves, who always called themselves Romans. 


(b) Phonology 

The differences between the Romance languages are discernible most at 
the phonological level, and it is on this aspect that traditional Romance 
linguistics has concentrated. Where Romance languages share phonologi¬ 
cal characteristics that are absent from Latin, it is usual to look to the 
proto-language for their source. We shall here examine some phonological 
developments that may date from a common Romance period. 
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(i) Vowel length 

The Latin phonemic vowel system comprised two inventories, one long 
(two moras) and one short (one mora). The long-vowel inventory had, 
besides the five vowels matched in the short system (i, u, E, o, a), three 
diphthongs. In Romance the diphthongs oe and ae were merged into the 
long and short e respectively. The diphthong AU, which in popular Latin 
usage seems to have merged with o: 

(19) CAUDA > CODA > French queue, Occ. coza 

is retained in some Romance languages, but merges eventually with 6 in 
most: 

(20) CAUSA > French chose, Occ. cauza. 

One salient difference between Latin and Romance is that the latter inher¬ 
its no phonological length distinctions from Latin (where lexically deter¬ 
mined long vowels were fixed by poetic authority, and metrical long vowels 
‘by position’). Even though it was never consistently marked in graphy, 
vowel length was indubitably distinctive in Latin, with minimal pairs like; 

Long and short vowels in Latin 


/i:/ 

PfLUM 

‘javelin’ 

III 

PiLUM 

‘hair’ (acc. sg.); 

/e:/ 

LEVIS 

‘smooth’ 

lei 

l6vis 

‘light’; 

/ai/ 

MALUM 

’apple’ 

lal 

mAlum 

‘evil’ (acc. sg.); 

loll 

5s 

‘mouth’ 

lol 

Os 

‘bone’; 

/ui/ 

FURIS 

‘of a thief’ 

lul 

fOris 

‘you rage’. 


Admittedly there seems to have been some morphological relationship 
between short and long vowels: 

(21) FiDES ‘faith’ / FiDES ‘you will trust’; sCo ‘I sew’ / sutor ‘cobbler’; attrActus 
‘attracted’/ ATTRAXi ‘I attracted’. 

There was also some interchange between long vowels and long conso¬ 
nants (geminates): 

(22) mucus/mOccus ‘mucus’, totus/tOttus ‘air, baca/bacca ‘berry’. 

Combinations of long vowel and long consonant are extremely rare in Latin: 
apart from derived forms, only stella ‘star’ and villa are usually cited. In the 
former case, the geminate is supported by other Indo-European evidence, 
which suggests assimilation from an earlier cluster. In Latin such clusters are 
usually simplified after a long vowel: 
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(23) mEd-cum > MECUM ‘with me’. 

Although Span, estrella and Ital. Stella imply a Latin geminate, the diphthong 
of French etoile indicates a form with single 1. 

How far the quantity differences were consistently functional in Latin has 
been debated. In atonic syllables, for instance, short vowels predominate. I 
have already suggested a tendency to schwa-like reduction. In tonic sylla¬ 
bles, too, fi, I and 0 are frequent. 

That the long and short vowels were acoustically similar is evidenced by 
loanwords - for instance, in Basque - which do not distinguish them. 
Moreover in Sard dialects long and short vowels fall together; this is also 
true of some of the vowels in Southern Italian dialects and Rumanian. 
Most languages, however, distinguish inherited Latin vowels by quality 
and not by quantity. Thus, inherited short vowels tend to have lower pro¬ 
nunciation than long vowels: E, for instance, would be [e], whereas fi is [e]. 
It is often assumed that colloquial Latin at some period must have had 
similar qualitative differences, which came to displace the redundant 
quantitative differences. 

It is from Romance data impossible to reconstruct the fact that Latin 
had distinctive vowel length. Vowel length differences may have been 
neglected in rapid speech, especially in varieties with a stress accent, and 
were therefore frequently accompanied or supplanted by qualitative dif¬ 
ferences. 

It has been suggested that differences in accent placement can be used to recon¬ 
struct Latin vowel quantity. However, this would depend on reconstruction of 
short vowels in penultimate syllables - which, we have seen, were probably regu¬ 
larly syncopated (though sometimes apparently restored). In any case accent 
placement in Latin depended not on vowel length but on syllable weight. 

Vowel reconstruction seems clearly to distinguish West Romance from 
East Romance (see 5.2). In the former E falls together with I, and o with 0: 

(24) credit ‘he believes’ > /krede/, pIra ‘pear’ > IperaJ; flOrem ‘flower’ > /(lore/, 
mCsca ‘fly’> /mosca/. 

The differences in treatment of vowels are linked by many Romanists to 
historically attested political separation of some areas from the Roman 
orbit: thus Sardinia was isolated from the second century, Rumania from 
the third. It is therefore assumed that the specifically Western developments 
- attested in France, the Iberian peninsula and northern and central Italy 
(and, incidentally, in Britain) - must have occurred at a later stage. 
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Diphthongization in tonic syllables in early Romance is connected with sylla¬ 
ble structure and is thought by some to result from lengthening of vowels in 
open syllables (see 4.2). This process seems to have no connection with Latin 
vowel length. 

Compensatory lengthening, by which merger of two phonological seg¬ 
ments results in one long one, was at the origin of many of the Latin long 
vowels. Similar processes took place later in separate Romance languages, 
so that some developed vowel length differences at some stage during their 
history. For instance, in the course of Old French the loss of medial sylla¬ 
ble-final consonants led to the creation of distinctions like these (where 
the circumflex accent marks a long vowel): 

(25) pate < PASTA ‘paste’ / patte ‘paw’ (probably of onomatopoeic origin); bite < 
best(i)a ‘beast’ / bette < beta ‘beetroot’. 

In most varieties of French these length distinctions have now been lost, 
(ii) Accent 

There is some dispute, as we have seen, about the nature of the Latin 
accent. Certainly in all the Romance languages the accent is one of stress 
(although some creoles seem to have tonal characteristics, like neighbour¬ 
ing African languages). 

The position of the accent in Latin was determined by morpheme and 
syllable structure, and was not distinctive. Within the word (not including 
any clitics, but embracing also affixes) the accent fell on the penultimate 
syllable, except when that syllable was light - that is, eontained a short 
vowel not followed by a tautosyllabic consonant. 

In Jakobson’s formulation, the accent falls upon that syllable which contains 
the second mora counted from the final syllable (where a short vowel has one 
mora, and a long vowel, two). 

The regularity of accent placement relative to syllabic structure is 
demonstrated by the way Latin displaces the accent of Greek loanwords 
to conform to its own pattern: 

(26) kdmelos > camelus ; kamdra > camEra 

In nearly every case the stress falls on the same syllable in Romance as it 
did in the Latin etymon. Occasionally, especially where the word was 
proparoxytonic, Romance has changed the stress pattern. This sometimes 
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happens as a result of lengthening the short vowel, so that the penultimate 
syllable becomes heavy: 

(27) *SAPERE for SAPfiRE ‘to taste, to know’: Ital. sapere. Span., Ptg. saber, French 
savoir, Catalan saber (but also sebre). 

One group of words fairly consistently shows displacement of accent: 
these are proparoxytones with a penultimate short vowel followed by an 
obstruent + r cluster. In Latin such clusters did not ‘make position’ - so 
that the preceding syllable was not heavy. In popular Romance reflexes, 
however, the accent falls on the penultimate syllable, and not, as in Latin, 
on the antepenultimate: 

(28) AlAcrem ‘lively’: Ital. allegro. Old French halaigre (< *alecrem); colCbra 
‘female snake’: French couleuvre. Span, culebra (but Ptg. cobra perhaps from 
the proparoxytonic form); iNTfiORUM ‘whole’: Rum. intreg, French entier (but 
also Old French entre), Ital. intero. Old Span, intrego (modem entero); cath£dra 
‘chair’: Old French chaiere (modem chaire, chaise). Span, cadera ‘hip’ (Span. 
cdtedra, Ital. cattedra are Latinisms). 

As a result of syncope, with loss of the short vowel in the penultimate 
syllable, many Latin proparoxytones became Romance paroxytones. 
Similarly, apocope of the final syllable results in a Romance oxytone: 

(29) sanitAtem ‘health’: Ital. sanitd, French sante. 

In Italian, in particular, the short vowel that should have been lost by syncope 
seems still to be retained in the modem standard, so that proparoxytones, as we 
have seen, are quite frequent: 

(30) HfiofiRA ‘ivy’: Ital edera t French lierre (Old iere). Span, hiedra; litt6ra ‘let¬ 
ters’: Ital. lettera I French lettre. Span, letra. 

Whether this is an effect of influence from Latin, or of anaptyxis of vowels to 
avoid awkward clusters, is not certain (see 7.2). 

The stress pattern in most of the languages is not now predictable from 
the syllable structure of the word and in the orthography the position of 
stress may be marked (consistently in Spanish, less so in Italian). Position 
of the stress may differentiate inflectional forms, as in: 

(31) Span. hablolhabl6,lta\.parlolparld ‘I speak/he spoke’. 

In French, where unstressed syllables were reduced and lost by the modem 
period, the stress falls regularly on the final syllable of the isolated word. 
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Occasionally a shift of stress can be used for emphasis {accent d’insistance), 
and some public speakers affect a diction which favours stress on the first sylla¬ 
ble of the word. However, modern French does not normally use word stress 
but rhythmic group stress, with only the last syllable of the group receiving 
stress. The modern habit seems to have become established in the early modern 
period. Evidence from the sound changes suggests that Old French had a heavy 
word stress but that by the sixteenth century words were no longer distin¬ 
guished within the rhythmic group (see 7.2). 


(iii) Palatals 

As an example of the way the phonology of Romance languages differs 
collectively from Latin, let us consider the phenomenon of palatalization. 
What seems to have happened to differing degrees in each of the Romance 
languages is the movement of back and front consonants towards the cen¬ 
tral palatal position, thus creating new phonological distinctions and a 
series of palatal consonants. The processes involved are usually the well- 
known ones of assimilation, by which features of one segment are trans¬ 
mitted to a juxtaposed one. Similar changes are widely attested in other 
languages. 

In Romance, the picture that is usually drawn is of a rich array of 
palatal and palatalized consonants at some unspecified medieval period, 
with some of these merging or moving forward to dental position in more 
modern periods. It is difficult to decide for sure how far these postulated 
palatal consonants were, at any particular stage in the past, distinct 
phonological units, rather than merely phonetic variants of other units. 

Other interlocking sound changes have to be taken into account, so that, for 
instance, palatalization of I^atin c Ik! has to be related to the simplification of 
QU (/k"/ or /kw/): 

(32) *CINQUE [k-] (dissimilated from quinque) ‘five’. > French cinq [s], Italian 

cinque [fl, Span, cinco [0] / quindecim [k”] ‘fifteen’ > French quinze. Span. 
quince [k], but Ital. quindici [kw] / qui/quem [k"] ‘who/whom’ > French qui [k], 
Ital. chi [k], Span, quien [k] / Rum. cine 

Shifts in the lexical stock of the languages probably had an effect on the 
phonological status of the consonants, and perhaps led to some simplifi¬ 
cation of the variation range. This happened especially when frankly 
Latinized forms were borrowed, particularly in the late Middle Ages and 
the Renaissance period. 
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Latin consonantal i 

In Classical Latin there were no phonological palatal consonants. The 
vowel I, however, had as a phonetic variant, palatal glide [j]. There were no 
minimal pairs that distinguished the vocalic and the consonantal variants, 
but, because prevocalic i in word-initial position is often cognate with an 
initial jod in other Indo-European languages, it is assumed that in Latin 
too it was consonantal. However, in the literary language, word-initial pre¬ 
vocalic I sometimes counted as a syllable, while postconsonantally it often 
‘hardened’ to a non-syllabic (jodization). 

In Romance, Latin word-initial I gives in nearly all languages the same 
outcome as word-initial G-t-® ' and as [dj], which suggests hardening: 

(33) lAM ‘already’ > Ital. gia [ch], French ja [ 3 ] (as in deja). Span, ya [j]; gelu 
‘frost’> Ital. gelo, French gel. Span. hielo\ diurnu ‘day’ > Ital. giorno, French 
jour I HODlE ‘today’ > Ital. oggi. Span. hoy. 

Those who maintain that colloquial Latin had a stress accent also 
assume that prevocalic I was always pronounced as jod. In intervocalic 
position the testimony of grammarians confirms that i was a geminate [jj]. 
Furthermore it ‘made position’ so that the preceding vowel was counted as 
long for metrical purposes. 

Late Latin texts provide ample evidence that clusters of consonant + 
jod were tending to assimilate to produce an intermediate palatal conso¬ 
nant (for instance, [dj] was written as z, and [kj] and [tj] were frequently 
confused). The almost regular confusion of prevocalic 6 and 1 suggests 
that both had merged as jod. The Appendix Probi, for instance, cites 
forms like: 

(34) VINEA NON VINIA, ALLIUM NON ALLEUM 

heralding the palatalization in all the Romance languages: 

(35) Rum. vie (dialectal vine), Ital. vigna, French vigne. Span, viha, Ptg. vinha, Sard 
bindza ‘vine’; Rum. aiu, Ital. aglio, French ail. Span, qjo, Ptg. alho ‘garlic’. 


Latin consonantal clusters 

Other Late Latin combinations which were ripe for palatalization were 
those composed of a velar consonant and a front consonant - like L 
(including clusters resulting from syncope of the intertonic short vowel, as 
in the diminutive suffix -CLU) or T, as in, for instance, factum. In the for- 
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mer case, as we have seen, the cluster palatalizes everywhere, whereas in 
the latter there has been resistance to palatalization in some languages (see 
example ( 11 ) above). 

Latin GN poses a particular problem, as it is not clear whether this indeed 
represented a cluster, a geminate or a single segment. In Western Romance 
languages it palatalized to [ji]: 

(36) AGNELLU ‘lamb’ > Ital. agnello, French agneau. Cat. anyell, lignu ‘wood’ > 

Ital. legno. Old French lein. Span, leno, Ptg. lenho (but Rum. iniel (dialectal / 
nel. Southern Ital. dial. Hunu, Sard linnu). 

Most Romance languages retain a palatal n [ji], and a palatal / [X] (in 
French it simplified to jod in the eighteenth century, and in modern 
Spanish varieties it has been transmogrified to [j] or [ 3 ]). 

In Spanish and Catalan a geminate n or l also resulted in palatal [ji] and [A]: 

(37) ANNU ‘year’> aiio, any, castellu ‘castle’ > castillo, castell. 

Catalan also palatalizes l in initial position to [A]: 

(38) llengua I Span, lengua < LINGUA. 

In Spanish, word-initial clusters composed of a consonant +1 palatalize to [A]: 

(39) CLAMARE ‘to cair> llamar, plorare ‘to deplore’ > llorar, ‘to weep’ / Cat. cla- 
mar, ptorar, Ptg. chamar, chorar, Ital. chiamare. Old piorare (modern piangere 
< plangere). 

In Italian, on the other hand, the geminates are retained and [A] derives only 
from medial gl or [Ij] clusters; 

(40) vig(i)lare > vegliare ‘to watch at night’ (but cf. Old vegghiare), melius ‘better’ 
> meglio. 

Latin velar consonants 

A striking feature of Romance is the almost universal palatalization of 
Latin velar consonants when followed by front vowels - a fact that causes 
confusion in school pronunciation of Latin, with Caesar pronounced e.g. 
[sezar] or [tfesar], rather than the [kajsar], which is vouchsafed by the 
German loanword, Kaiser. It is usually assumed that in Proto-Romance 
assimilation of back consonants to a following front vowel must have 
already begun, but whether this merely resulted in phonetic variants or in 
phonological splits is disputed. 
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The Late Latin texts provide very little help in dating the changes. 
Certainly in Sard the palatalization did not take place: this is usually 
linked to the early dissociation of the island from the rest of the Latin¬ 
speaking world. There is evidence of a similar, but inconsistent, resistance 
to this type of palatalization in Dalmatian: 

(41) Veglia toina < cEna‘ dinner’. 

To take as an example the treatment of word-initial /k/ + ‘, nearly every¬ 
where in Romance it acts like medial /kj/: 

(42) civiTATE ‘city’ > [tf] Ital. cittcf, [s] French cite, Ptg. cidade, Cat. ciutat; [9] Span. 
Ciudad / fac\aIf\cies ‘face’ > lta.\. faccia, French face, Ptg. face. Cat. fas. 

Span, haz (but Rum. cetate [tf] / fa0 [ts]). 

This suggests a palatal glide transition between consonant and vowel, or 
at any rate transfer of palatal feature from the vowel to the consonant. 

Despite such similarities, Romance languages (leaving aside Sard) seem 
to split into two groups. The palatalized [k] can end up as a dental affricate 
(and then usually a sibilant), eventually losing palatal features: 

(43) CENTU ‘hundred’> French cent [s], Spanish ciento [0] or [s] 

where earlier evidence reveals that the initial consonant was an affricate 
[ts]. Or it can turn into a palatal plosive or palatal-alveolar affricate (some¬ 
times simplifying to a fricative): 

(44) Ital. cento, Dalmatian (Veglia) cant, R-R (Engadine) caint [tf] (cf. Sard kentu). 

We assume that the former had an apical and the latter a laminal articu¬ 
lation. Whether one of these outcomes can be traced as having passed 
through the other is hotly disputed. The dental (apical) outcome is less 
widespread in Romance than the palatal-alveolar (laminal). In France, for 
instance, both Northern and Occitan dialects prefer the latter. 

The shift to dental articulation can be seen as the culmination of the 
fronting process, while a palatal or palatal-alveolar sound can be seen as a 
half-way house, resulting from a longer resistance to palatalization in the 
more conservative areas. The distinction between Rumanian reflexes 
exemplifled in (42) above may support this hypothesis, in that the initial 
c+®’' seems to have been more resistant than medial [kjj. 
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Inconsistent developments 

Similar inconsistencies, especially with Latin voiced consonants, are found 
elsewhere. This is particularly so in Italy, where different reflexes of the 
same word are often attributed to different regional developments: 

(45) MEDiu ‘half’ > mezzotmeggio, *gregiu from grege ‘herd, flock’ > 
grezzo/greggio ‘untreated cloth’. 

Latin medial [dj], like [gj], had almost merged with [jj] in Late Latin, so 
that we expect to find parallels like: 

(46) MAIU ‘May’/MEDIU: Rum. maiulmiez, Sard maiulmeiii, French mai/mi, Ptg. 
maiohneio. 

Italian maggio and meggio, with [dcfe], thus appear anomalous, with the 
palatalized hardening that is found in strengthened word-initial position - 
see example (33) above. The [ddz] of mezzo similarly recalls dental harden¬ 
ing of postconsonantal [dj] as in: 

(47) HORDEU ‘barley’ > Rum. orz, Dalmatian (Vegliot) vuarz, Ital. orzo, Sard 
ordzu, R-R (Engadine) uenti. Cat. ordi (but see French orge with [t^] > [ 3 ]). 

Hardening, either towards the dental or the palatal, can occur also with [nj]: 

(48) LINEU ‘linen’ > Sard lindza [dz], French tinge. Old [(^] (cf. linea ‘line’> French 
ligne. Span, lifia, with [p]). 

The ‘dental hardened’ reflex can have the same outcome as [ndjj: 

(49) VERECUNDIA ‘shame’ > Sard birgondza. Span, vergiienza (but Old Span, ver- 
gueiia, Ital. vergogna. French vergogne). 

French anomalous developments in, for instance: 

(50) .lage < *SAPius for sapiens ‘wise’ / sache < sapiat ‘that he know’ 

may likewise result from differential resistance to voicing of voiceless conso¬ 
nants in clusters threatened by palatalization. 

If we turn to consonantal clusters that are prone to palatalization we can 
also attribute differential development to greater or less resistance to the 
process. The -ct- cluster palatalizes completely in the Western languages, 
resulting either as [tf] or as [jt] with the glide usually absorbed into the pre¬ 
ceding vowel: 

(51) FACTU ‘fact, doing’ > Span, heclio, Occ. fach / French fait, Vxg.feito, 
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In more conservative languages, however, the velar completely disappears, 
or is transmogrified into a labial: 

(52) ltal.fatto,Seiidfaltu/R\im.fapt. 

Thus the Romance languages have diversified in their more-or-less con¬ 
servative reaction to the palatalizing processes they all share, and which dif¬ 
ferentiate them from Latin. There seems to have been some resistance in 
nearly all languages to the ‘Proto-’ or pan-Romance assimilatory palatal¬ 
ization that ultimately could result in the complete disappearance of func¬ 
tional distinctions. Later palatalizing processes will be discussed in 6.3. 

(c) Grammar: nominals 

Whereas Latin inflections have survived to a considerable extent in the 
Romance verb, Romance nominal organization has changed more radical¬ 
ly. Coseriu (see 1.1), in characterizing the Romance prototype, has framed 
this in terms of ‘internal’ and ‘external’ relationships. Anything more rele¬ 
vant to the (internal or psychological) semantic import of an item - as 
with tense-aspect-mood modifications of the reporting of an event - will 
tend to be expressed ‘internally’, or morphologically, whereas relation¬ 
ships in the real (external) world, like between participants in an event, 
will be expressed ‘externally’, or syntactically. On the whole, the Romance 
languages use ‘internal’ (morphological) marks for the verb system and 
‘external’ (syntactic) marks for the nominal system. In this section, we 
shall look at the latter, comparing Romance with Latin. 

(i) Cases 

The Latin nominal case system marked morphologically the grammatical 
relationship of nouns to the verb, or to other nouns, within the clause. The 
term case is based on the metaphor of ‘falling’ away from the ‘straight’ 
form - the nominative, marking the subject argument of the verb, and seen 
as the unmarked citation form of the noun. The term inflection ‘bending’ 
partakes of a similar metaphor. The non-nominative forms are, in the 
same spirit, usually termed oblique. Although even in Latin grammatical 
information conveyed by case inflections could be supplemented by other 
devices, like use of prepositions, in the prestige language the use of bare 
case forms was regarded as more elegant and economical. 
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Suetonius reports that the Emperor Augustus preferred less ambiguous preposi¬ 
tional uses. In later Latin texts the use of prepositions became more frequent. 

The inflectional case system has virtually disappeared from modern 
Romance and has been replaced by word-ordering conventions (as with 
subject-object relationships) or by use of prepositions (as with dative, 
genitive or locative relationships). 

Morphological case has remained more intact in the pronominal sys¬ 
tem. From a functional point of view this is explicable if we recognize that 
the semantic (or theta) role of a lexical noun is often fairly predictable 
from its inherent semantic features: for instance, an animate is more likely 
to be an agent or an experience^ and so on. Pronouns, on the other hand, 
convey only grammatical, not lexical, information, and thus may need to 
be more clearly marked for such information. 

The Latin nominal system marked five cases (leaving aside the vocative, 
used in forms of address, and the locative, of which there were only relics) 
and two numbers. Syncretism of forms meant that a maximum of eight of 
the possible ten forms were distinctively marked. In Romance the singular 
is usually distinguished from the plural, but remnants of the case distinc¬ 
tions are few. 


(ii) Relics of the Latin case system 

Old French and Old Occitan had a nominal case system which seems 
clearly derived largely from the Latin, but which distinguished morpho¬ 
logically only two cases, nominative and oblique, almost exclusively in the 
masculine. Here are some examples of forms. 


Nominal case in Old Occitan and Old French 



Singular 


Plural 


nominative 

murs 

MURUS 

mur 

MURI 


paire! pedre(s) 

PATER 

paire!pedre 

PATRES 


senher 1 sire 

SENIOR 

senhor i seignor 

SENIORES 

oblique 

mur 

MURUM 

murs 

MUROS 


paire! pedre 

PATREM 

paireslpedres 

PATRES 


senhor 1 seignor 

SENIOREM 

senhors iseignors 

SENIORES 


A non-etymological -s marking the nominative singular extended in 
French to other masculine nouns (like charbons < CARBO), and even to 
feminines (like maisons < mansio). This was fairly frequent in Eastern 
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texts in the late thirteenth century. By this period, however, word-final [s] 
had been lost in some varieties, and hypercorrection may have played its 
part in the graphical spread of -s. The vitality of the imparisyllabic stem- 
alternating pattern is evidenced by its use in Germanic borrowings (like 
berlbaron), but it is almost wholly confined to human reference. 

The case marking seems to have had little functional value, as nearly 
always in texts it is possible to deduce the thematic role of nouns from 
other factors, like word order or lexical constraints. Starting with the 
Western varieties, French had lost its nominal case system by the fourteenth 
century, usually via the disappearance of the nominative forms, so that the 
-s came to mark plurality, rather than case. In Occitan, too, prose texts in 
particular show signs of disintegration of the case system from quite an 
early period, but it survived rather longer than in French, persisting 
through the fourteenth century. 

In some Alpine Romance varieties there remain traces today of Latin 
nominal case marking. Apart from isolated lexical items, relics of case are 
found in predicative masculine singular adjectives which usually inflect 
like plurals. This is most evident in Surselvan Rhaeto-Romance, for which 
there is some evidence in the earliest (seventeenth century) texts of a sys¬ 
tem illustrated below: 

Old Surselvan adjectival inflection 



Singular 


Plural 


nominative 

sauns 

SANUS 

sauni 

SANI 


mes 

MEUS 

mei 

MEI 


clamaus 

CLAMATUS 

clamai 

CLAMATI 

oblique 

saun 

SANUM 

sauns 

SANOS 


miu 

MEUM 

mes 

MEOS 


clamau 

CLAMATUM 

clamaus 

CLAMATOS 


The modern Surselvan masculine plural forms reflect the old oblique for 
adjectives (e.g. sauns ‘healthy’) and the old nominative for participles (e.g. 
clamai ‘called’). This distinction is not retained in other Rhaeto-Romance 
varieties. The varieties in Italy generalize, like Italian, what may be a nom¬ 
inative plural form (e.g. debit = Ital. deboli ‘weak’), whereas those in 
Switzerland generalize the oblique plural forms. In the masculine singular 
Surselvan however distinguishes predicative (i.e. post-copular) adjectives 
from adnominal (‘epithet’ or ‘attributive’) forms: 
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(53) Quei gat ei ahs I in gat alv ‘That cat is white’ (albus) / ‘a white cat’ (ALBUM); II 
cavagl ei vegls / in cavagl vegl ‘The horse is old’ (vetulus) / ‘an old horse’ 
(VETULUM). 

For some common adjectives the Latin origin of the distinction is not so 
clear, as the adnominal, but not the predicative, form shows umlauting, 
suggesting a distinction between etymological forms with a high (u) versus 
a mid (o) final vowel: 

(54) bienibuns ‘good’ (bonu/bonos?), gries/gross ‘big’ (GROSSU), niev/novs ‘new’ 
(novu), miartbnorz ‘dead’ (mortuu). 

These forms suggest that it is the oblique plural, and not the nominative 
singular, form that has invaded the territory of the singular predicative 
adjective. Some traces of this distinction are found in other Rhaeto- 
Romance varieties, especially with possessives, though usually a distinc¬ 
tion is retained between singular and plural forms. 

It has been suggested that the Surselvan distinction between ‘inflected’ 
predicative and ‘uninflected’ adnominal adjectives owes much to the 
influence of German. However, similar distinctions are found in modern 
Alpine Francoprovenqal varieties, which have not been in close contact 
with German. The best evidence we have for this was collected in the mid- 
1950s in the Savoy Maurienne valley which leads to the Modane tunnel 
into Italy, but for other Alpine valleys there are similar data. In general a 
distinction is made between an unmarked masculine singular form used 
not only as an adnominal but also with non-count nouns and imperson- 
als, and the marked predicative form which usually is identical with the 
plural: 

(55) Saint-Martin-la Porte (Maurienne) D amopa lo lae frejt / / tro frejt I Lo lae 
frejt e pas bo / Lo lae: e frej / Lo pla so frej ‘I don’t like cold milk (lacte 
FRIGIDUM?) / ‘It’s too cold’ / ‘Cold milk isn’t good’ / ‘The milk is cold’ (lac est 
FRIGIDUS?) / ‘The dishes are cold’ (frigidos?). 

Regular patterning in this variety clearly points to the predicative adjecti¬ 
val form being identical with the plural. 

The z [ts] which appears on the nominative count noun laez in example (55) is 
not, it seems, a case marker but a word-final consonant that is regularly elided 
in preconsonantal position: 


(56) 


u bez anez ‘a fine lamb’ / u be voez ‘a fine calf’. 
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However, in neighbouring dialects there is evidence of residual retention of a 
noun case distinction. For instance, older informants in reciting fables might 
distinguish: 

(57) nom. korbjo, rejno I oblique korbe, rejnot ‘crow, fox’ (French corbeau < corbels, 

renard < renart). 

It has been suggested that the present-day Surselvan distinction can best 
be regarded as one of gender rather than of case, with the predicative 
adjective marking three genders (m./f/n.), rather than two. In 1.3 we 
argued that parallel distinctions in Asturian and Abruzzo might better be 
treated as of case, and that Romance is characterized by a two-gender 
(m./f.) system. The Alpine evidence clearly shows that the marking origi¬ 
nates in case distinctions which have survived residually. 

How far they are functional today is hard to judge: perhaps here we have an 
example of exaptation (see 7.5). 


(iii) Nominative or oblique? 

Received wisdom has it that the Romance noun is normally derived from 
the Latin accusative form, the singular of which is usually cited as the 
etymon. It is usually assumed that the accusative form was unmarked in 
Latin, especially in the later period. The -M accusative mark is known not 
to have been ‘fully’ pronounced even in prestigious Golden Age usage: 
final -M did not count as separating syllables in metrics and it was fre¬ 
quently omitted in graphy in both early and late texts. It is assumed that 
it was, at most, a graphical representation for nasalization of the preced¬ 
ing vowel, and that the nasal quality was lost in the Romance languages. 

In some monosyllables it hardened again to a nasal consonant; 

(58) REM, MEUM > French rien, mon (with subsequent vowel nasalization); quem > 

Span, quien. 

In the first (a-) declension, there is no way of knowing whether the ances¬ 
tral form is the nominative (in -A), the accusative (in -am) or, because of the 
regular loss of vowel length with this vowel, the ablative (in -A). In the sec¬ 
ond declension, ‘Western Romance’ merger of Cr and o means that we can¬ 
not tell whether the ancestral form is the accusative or the dative-ablative. 
Umlauting dialects often however seem to derive their masculine singular 
forms from an u form (nominative or accusative) rather than an o form: 
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(59) Span. dial. (Asturias) guetu ‘cat’ / gatos ‘cats’; Ital. dial. (Lucania) 
mwertulmorta ‘dead’ (m./f.). 

Note that in Southern Italian dialects the plural marker (< I) triggers umlauting. 

As we have seen, the count/mass distinction of Asturian and South- 
Central Italian varieties may stem originally from a case distinction, with 
count forms from the nominative-accusative in 0, and mass forms from 
the dative-ablative in o. The differential development of these two final 
vowels is shown in: 

(60) Asturias ibiernu ‘winter’ < hibernu / como ‘how’< quOmodS; Central Ital. 
dial. (Trevi, Umbria) corpu ‘body’ / oto ‘eight’ < ocro. 

The best evidence that Romance nouns do not normally derive from the 
nominative Latin forms is provided by the third declension, where very 
frequently, mainly as a result of sound changes that occurred in Latin, the 
nominative singular stem is shorter than that of the rest of the paradigm. 
Nearly always it is the longer (oblique) form that seems to survive into 
Romance (see e.g. (52) in chapter 1 for lacte, ponte etc.). Other frequent 
pan-Romance words include: 

(61) PES, PEDIS ‘foot’ > Ital. piede. French pied\ flos, floris ‘flower’ > Rum.^oare, 

Ital.^ore, Sard flare, French Occ., Cat. flor (Span, flor, Vtg.frol are prob¬ 

ably loanwords). 

However, it is not always possible to tell from the Romance forms which 
the ancestral Latin form was (see e.g. Span, pie, Ptg. pe ‘foot’). Moreover, 
it seems certain that some frequent words, in some languages at any rate, 
derive from the Latin nominative: these are often nouns denoting human 
beings: 

(62) SOROR, SORORIS ‘slstcr’ > Rum. sor(a) Ital. suora ‘nun’, French sceur, PRESBYTER, 
PRESBYTERUM ‘priest’ > Rum. preot, Ital. prete, French pretre. Cat., Span., Ptg. 
preste. 

Sometimes both nominative and oblique forms are found: 

(63) HOMO ‘man’ > Rum. am, Ital. uomo, French on / hominis > French homme. 
Span, hombre, Ptg. homem. 

In many North Italian varieties many forms appear to be derived from the 
Latin nominative, suggesting that the case system survived longer here than 
further south. 
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(iv) Differential object marking 

Some Romance languages differentially distinguish the object of the verb 
from the subject by a special particle usually derived from a Latin preposi¬ 
tion. The prepositional accusative (PA) is best known in modem Spanish, 
where it is also called the ‘personal accusative’ because it is used normally 
with human objects, which are usually particularized. It was found spo¬ 
radically in Old Spanish but it became grammaticalized at the time of 
standardization of the language in the sixteenth and seventeenth cen¬ 
turies. It seems to be used in all varieties of Spanish, though there is some 
dialectal variation in its scope. 

The marker is derived from directional ad (‘to’), which is also used as a 
dative marker. Traditional Romance philology sees it as an extension, dur¬ 
ing the medieval period, of dative marking. This is also generally found 
more often with animates, which figure more frequently as ‘recipients’. In 
some varieties the dative is distinguished from the PA by a differentiated 
pronoun copy (dative le I accusative lo, la-, see 4.3): 

(64) Lo vimos a Juan ‘We see [him to] John’ / Le damos a Juan un libro ‘We give [to- 
him to] John a book’ 

but leismo (see 6.5) varieties do not make the distinction. 

It has been suggested that the a is required to assign case to the noun object in 
sentences where the verb has already assigned structural case to a clitic object. 
It is, however, obligatory with personal names, for instance, even where there is 
no clitic copy: 

(65) No conozco a Pepe ‘I don’t know Joe’. 

A plausible explanation for the development of PA is that it picks out, as 
an object of the verb, a noun which might otherwise be understood, by 
virtue of its animacy and definiteness, as an agent and topic, in a language 
where constituent order does not unambiguously signal grammatical rela¬ 
tions. Potential ambiguity was given as a reason for prescribing the use of 
PA in early Spanish grammars. 

The occurrence of PA earlier, and more frequently, with pronouns, 
which are morphologically or lexically marked for case, however, suggests 
that the disambiguation argument does not hold water. Besides, PA is par¬ 
ticularly frequent in dislocated topical expressions, usually echoed by a 
clitic pronoun. These circumstances lead some investigators to postulate 
that PA was originally a stylistic device, used to highlight an argument of 
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the verb. Its use has also been linked to increased concern with the individ¬ 
ual in the early Christian era, and it is seen as a proto-Romance innova¬ 
tion, consequent on the decay of the inflectional case system, designed 
clearly to differentiate the sentence roles of human agent and patient. 

Certainly parallel uses are found in other Romance languages, though 
not as consistently as in modern Spanish. In Portuguese, after an upsurge 
of popularity of the construction during the seventeenth century, it is lim¬ 
ited in its distribution. It may be that the reaction against Spanish influ¬ 
ence during the period of standardization of Portuguese discouraged its 
propagation in prestigious usage. Galician has not participated in the 
Portuguese reaction and makes full use of PA. 

In Catalan, too, where regular use of PA is frowned on, as a Castili- 
anism, it is frequent, not only in ‘street language’, but also, for instance, in 
more traditional use in Majorca. In Western Oceitan, especially in 
Bearnese, the construction is particularly frequent in popular speech (and 
is transferred to regional varieties of French). It is also a feature of Sard 
and of Southern and Central Italian dialects, and of Engadine Rhaeto- 
Romance. 

In some Rhaeto-Romance dialects the agglutination of ad to the third-person 

clitic pronoun til etc., may explain its unusual form. 

In all these areas some reflex of ad is used as the accusative marker: in 
Catalan a prevocalic form an suggests a conjunction of in -i- ad, found 
also in Gascon ena, whereas in North Italian immigrant dialect islands in 
Sicily we find da, suggesting a combination de -i- ad. Other prepositions 
can, however, perform the same function. In Rumanian pe (< per 
‘through’) is fully grammaticalized in much the same way as in Spanish; 
sixteenth-century texts also used spre (< super ‘above’). As Rumanian 
varieties south of the Danube do not regularly use PA, it is usually 
assumed that it was a comparatively late development in literary Daco- 
Rumanian. In Old Portuguese, and today in some creoles, para could be 
used to mark the object, and today Kristang (see 2.2) uses ku from con. 

Everywhere the marker is used preferentially with tonie personal pro¬ 
nouns (often with clitic doubling - see 4.4), with personal proper names and 
with singular nouns designating speeific human beings. It is less often used 
with plurals or collective nouns and in some varieties (e.g. Sard, Corsican, 
Bearnese, Rumanian) it seems almost mutually exclusive with the definite 
article. In Sard, use of PA (with a) is obligatory with all definite accusative 
NPs without a determiner: 
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(66) Connosco a Juanna I a issa ‘I know Joan/her’; Istimo a frate men ‘I love my 
brother’; but Amus visto (*a) mu pasture I sos sordatos ‘We’ve seen a shepherd / 
the soldiers’. 

The fact that PA is not found in French (except in some peripheral nonstandard 
varieties) is seen as linked to the medieval retention of an inflectional nominative- 
oblique case-system (which was mainly operative for nouns denoting human 
males). It is claimed that the Occitan case system was probably of Eastern dialec¬ 
tal origin, while the Western dialects favoured PA. Particularly striking is the con¬ 
trast between neighbouring Rhaeto-Romance dialects, with Surselvan having 
retained an inflectional case system, and Engadine developing PA. 

(v) Oblique case 

Leaving aside the marking of personal accusatives by a preposition, most 
modern Romance languages have for nominals a direct/oblique system, 
where nominative and accusative (direct cases) are grouped together in 
face of other (oblique) cases which are normally marked by a preposition. 

The Rumanian system is, however, inflectional, with most inflections 
dependent on an agglutinated definite article (or pronoun? - see below) 
differentiated for direct versus oblique (dative-genitive) case. Some 
Rumanian cases, however, do not depend wholly on definite-article inflec¬ 
tion. The vocative in -le m., -o f., and -lor pi.: 

(67) Radide! ‘Radu/’, Ano! (/ And!) ‘Anna!’, mamelor! (/ mame!) ‘mothers!’ 

is thought by some to be influenced by Slavonic. In the feminine too the 
articleless oblique singular in -e is identical with the plural and not with 
the direct singular in -a: 

(68) mama (nom. sg.) / mame (other cases). 

This ending may derive from the Latin genitive in -AE. 

Otherwise traces are rare of the Latin genitive case (which marks relation¬ 
ships between nouns, and seldom between noun and verb). In Latin, 
already, the inherited genitive -s had been replaced in the first and second 
declensions by -T (ai > ae, oi> I) except for a few relics (pater familias 
‘father of the family’), and it has no reflexes in Romance. Some traces of the 
plural flection -5rum (< os-um) are to be found: in Sard possessive issoro (/ 
de issos), and in pronouns and determiners Rum. lor, Ital. loro, French leur. 

In very early Old French isolated nominals (quasi-adjectival) are found: 
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(69) gens Francor ‘people of the Franks, Frankish people’, tens ancienor ‘times of 
the ancients, old times’. 

Everywhere, except Rumanian, the directional preposition dE is used to 
indicate possession; it is also used as a partitive marker in some languages 
(see 6.5). It is rivalled to some extent by ad, which tends to be used with 
personal possessors in many varieties: 

(70) French le livre d moi ‘the book belonging to me’ / ns. le livre d Jean ‘John’s 
book’; Rum. tatda trei copii ‘father of three children’. 

For family relationships flectional genitive marking survived into some 
medieval varieties: 

(71) Old French lifils (nom.) lerei (oblique) ‘the king’s son’. 

Everywhere except Rumanian uses the preposition ad to mark dative 
(recipient) function, which is not consistently distinguished from location¬ 
al goal. In some languages, however, there is some tendency to distinguish, 
by different pronominal reference, true datives (as in dormer a. ‘give to’) 
from lexically determined dative use (which can often be linked to PA), as 
in obeir d ‘obey (to)’ and directional movement, as in aller d ‘go to’. 

The agent in passive constructions is marked by a preposition derived from 
PER (French par. Span, par) or DE (+ ab) (Ital. da). 


(vi) Pronominal case 

As elsewhere, Romance pronouns are more apt to distinguish case than 
are nouns. We have seen that discourse pronouns continue on the whole 
the differentiated singular nominative and accusative case forms of Latin, 
and that there is some tendency to create new distinctions in the plural. 

However, disjunctive forms for persons 3 and 6 usually do tend to distin¬ 
guish nominative and accusative. Some originally nominative forms like 
Span, el, ella, Ptg. ele, ela and Rum. el, ea, ei, ele are used both as subject 
and as object of a preposition (although in Rumanian, directional prepo¬ 
sitions often govern oblique case). In Italian the nominative forms: 

(72) egli m. sg. < ILLI; egUno pi. (with plural verb inflection?), esso etc. < IPSU etc., 

and ello etc. < ILLU etc. (the preferred Tuscan forms) 

have largely given way to lui, lei, which were originally dative (see also 
French lui (< dative) / elle, eux, elks (< acc.)). 
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Dative case is marked on Rumanian disjunctive person 3 pronouns (lui, 
ei, lor) and on clitic person 3 pronouns nearly everywhere. The clitic dative 
singular forms, which are not marked for gender, derive from Latin distal 
demonstrative m. sg. dat. illI: 

(73) Ital. gli. Old French li 

or a refashioned form *illiae (with addition of a feminine dative ending?): 

(74) Span, le (< [Ije]), Ptg. Ihe 

or *iLLUi (on the model of cut ‘to whom’): 

(75) French lui. 

Plurals derive from illIs (dat. pi.): 

(76) Sard Us, Span, les, Ptg. Ihes 
or from illorum (gen. pi.): 

(77) Ital. loro, Occ. lor, French leur. 

Few languages, however, distinguish the dative from the accusative in 
non-third-person clitic pronouns. Rumanian does so consistently, in the 
singular: 

(78) imilmd; Uilte 

and Galieian distinguishes from the person 2 accusative te, a dative che 
(often used as an ethic dative or ‘solidarity pronoun’): 

(79) O vecino trouxoche estas cereixas ‘The neighbour brought you these cherries’. 
Note that che is not used as a reflexive: 

(80) Comprouche un sombreiro ‘He bought you a hat’ / Compracheste un sombreiro 
‘You bought yourself a hat’. 

Far Western varieties of Galician use only che and far Eastern varieties only te 
as the person 2 clitic, in all contexts. 

Those Swiss Rhaeto-Romanee varieties which today use only disjunc¬ 
tive pronouns also made a distinction between person 2 dative and accu¬ 
sative clitics in old texts. 
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(vii) The definite article 

Loss of the Latin case system has often been linked in the history of 
Romance with another salient occurrence: the growth of the definite arti¬ 
cle. Usually the definite article is seen as a device for inserting the bare 
noun into the sentence, just as case did in Latin. Whereas the Latin noun 
had no phonological shape not associated with sentence role, in Romance 
the morphological noun conveys only lexical content (and presence or 
lack of plurality). Syntagmatic strategies are used to code grammatical 
relationships. 

The definite article seems to have been used initially less as a specifier, 
picking out the individuality of the object designated by the noun, than as 
a topicalizer, anaphorically marking a noun that is accessible to the hearer. 
As in other languages, the function of the definite article widened as time 
went on, as it was used with nouns denoting unique objeets and with 
abstracts, and to show inalienable possession, through a generic use to 
denote type rather than an individual, to become almost merely a gender 
and number marker for the noun. It is as fulfilling the last function that tra¬ 
ditionally Romance grammarians have characterized the definite article. 

In modern Romance the definite article is used with very high frequen¬ 
cy: in French, in particular, it is one of a series of virtually obligatory 
determiners that indicate nouniness and mark gender and number. In cre¬ 
oles, on the other hand, the Romance definite article survives sporadically 
only as an integral part of the noun. Other means (for instance, a location¬ 
al adverbial) are used to mark specificity or noun phrase status. 

The use of the definite article has never failed to strike observers as an 
outstanding difference between Latin and Romance. Latin grammarians 
frequently remarked on the absence of an equivalent of the Greek article 
in their own language. It has often been suggested that the artiele arose in 
Christian Latin as a transfer from Greek; however, evidence from early 
Latin translations of the Greek New Testament shows no consistent rep¬ 
resentation of the Greek article. 

As elsewhere, the initially anaphoric article usually derived from a deic¬ 
tic (demonstrative) pronoun, which indicated the locational position of the 
targeted noun. In most Romance languages the demonstrative which gave 
rise to the article was the distal pronoun ille, which also forms the base of 
the distal demonstrative (in all the Romance languages), reinforced by a 
preposed ecce ‘behold!’: 
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(81) Ital. quello. Rum. acel. Span, aquel, Ptg. aqiiele. Old French cil, Sard kuddu. 

Ille also survives in all the languages as a person 3 pronoun. One ques¬ 
tion that we should pose is how the pronoun relates to the article, in form 
and function; for the phonological relationship between determiners and 
their cognate pronouns see 7.5. 

One difference between pronoun and article is that normally the latter 
does not inflect for case. This does not hold for Rumanian or for Old 
French. In Rumanian the enclitic article is clearly related to the inflected 
disjunctive pronoun: 

Rumanian nominal inflection 

(‘mother(s)7‘son(s)') 

Singular 
nominative-accusative 

f. mama (mamd+ ed) 
m.flul (flu + el) 

genitive-dative 

f. mamei (mamd+ ei) 
m.fluluiiflu + lui) 


Plural 

mamete (mame + ele) 
flii tfii + el) 

mamelor (mame + lor) 
flilor ifli + lor) 


In Old French the masculine nominative article, unmarked for number, 
is etymologically related to, but not identical with, the corresponding pro¬ 
noun: 


(82) ILLI > article li, pronoun il 

whereas the modern surviving article forms are the same as the clitic 
object pronouns: 

(83) le, la, les. 

The only way these latter differ from each other today is that articles, but 
not pronouns, fuse with preceding prepositions: 

(84) du < de le, des < de les. 

This distinction was unknown before the fourteenth century. 

In some Swiss Rhaeto-Romance dialects a dative definite article - identical 
with the dative person 3 pronoun - is still in use: 

(Surmeiran) Zhi kwit li mama ‘Say this to [the] mother’. 

In Alpine Francoprovenpal an accusative definite article is also distinguished: 


( 85 ) 
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(86) (Valais) [i fwa] ‘the fire’(m.) / [am prejo o fwa] ‘I light the fire’; [i ^la] ‘the 
key’(f.) / [tijio a tfla] ‘I hold the key’. 

In most of the languages (but for Sard see below) the article closely resem¬ 
bles the person 3 object clitic pronouns. In Spanish the masculine singular 
is identical (except for a graphical accent) with the subject pronoun el. 
Here, as with Italian il, the difference probably derives from a phonologi- 
cally determined variant that has been standardized, rather than as a case 
distinction (see 7.5). The normal masculine (or in leismo dialects ‘non-ani- 
mate’) object clitic form lo is specialized in Spanish as the ‘neuter article’ 
(see 1.2). 

Another way in which the person 3 pronouns differ from the articles is 
that the latter normally accompany nouns, whereas the latter are attached 
to verbs. This is, however, not always so: articles can be used with adjec¬ 
tives, where there has been apparent deletion of the noun: 

(87) le livre blanc et le rouge ‘the white book and the red one’. 

In Italian a demonstrative, not an article, is used in this context: 

(88) guello rosso 

while in Rumanian both are used: 

(89) cel ro§ul. 

Moreover, in all of the languages at an earlier period, and still in some, the 
article/pronoun form can be used as a pronominal: 

(90) Span, la que buscas, Ptg. a queprocuras ‘the [woman] you are looking for’; R-R 
(Engadine) Us de Samedan ‘the [people] of Samedan’; Sard (Nuoro) su ki keljo 
‘the [thing] I want’; Old French la sa mere ‘the [soul] of his mother’. 

Thus the differences between article and pronoun are less pronounced 
than usually assumed, and it is no wonder that nearly all early Romance 
grammarians, and some present-day ones, see them as distributional vari¬ 
ants. In particular, it is suggested that they fill the function of agreement 
markers, cliticized to noun or verb. 

(viii) Ipse as an article 

Sard here seems to present a special case. The Sard article derives from the 
‘identifying’ particle ipse ‘himself etc.: 


(91) 


SU, sa, SOS, sas 
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as do the disjunctive third person pronouns: 

(92) issu, issa, issos, issas. 

Clitic object pronouns, on the other hand, continue, as in all other 
Romance languages, ille forms: 

(93) lu, la, los, las. 

Some Catalan varieties also make use of both ipse and ille article 
forms, but only ille pronominal forms. No Romance language uses an 
ipse form as third-person non-reflexive clitic object pronoun; if they did, it 
would be likely to be confused with reflexive se forms. 

There is evidence that IPSE article forms were once more widespread, though 
confined mainly to the ‘central’ Romance area, and popular in character. One 
factor that could have worked against the retention of ipse articles elsewhere 
was their potential homonymy with suus ‘weak’ possessives. In modern Sard 
only ‘strong’, adjectival possessives (see 6.5) are found, but in Catalan varieties 
which use ipse articles, the weak ‘determiner’ possessive, in the feminine singu¬ 
lar form sa at least, can indeed be confused with the article. However, the weak 
possessive is in modern times restricted to use with the nouns denoting close 
kinship: the semantic difference between sa mare in its two possible senses 
(‘the/his mother’) is negligible. 

We may well ask why ipse should have been ousted in most of the Romance 
area. Such textual evidence as we have suggests that by the Late Latin peri¬ 
od it had become the preferred adnominal identifying particle, and was 
ousting IS in its anaphoric function. Ille , which had in classical times been 
less frequent, had become the deictic of choice, as the proximal HIC lost 
ground, but it was still used more often pronominally than adnominally. 

It seems that ipse became more identifled with subject arguments and 
participant-orientated functions, while distal demonstrative ille referred 
more to third-person objects. Texts confirm that usually ipse shows a more 
decided preference for nominative case. As modern Romance lexical 
nouns stem mainly from accusative Latin forms, it would not be surprising 
if they prompted a distal ‘object’ anaphor. 

What is puzzling is that some varieties prefer the ipse forms for the definite 
article. Whether indeed this choice is accompanied by other relevant morpho¬ 
logical or syntactic differences requires further investigation. In Sard we might 
look to such features as the use of the definite article in vocatives, the great fre¬ 
quency of clitic doubling, with left or right dislocation of the object NP - 
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which suggests that the NP may be syntactically detached from the sentence - 
the tendency to non-use of the determiner in predicate nominals; and preposi¬ 
tional marking of object nominals. 


(d) Verbs 

It is in the verbal system that Romance languages are said most clearly to 
reflect their Latin origin, and their close relationship to one another. We 
shall here examine some aspects of verb morphology which illustrate 
these relationships. 

(i) Conjugation system 

Latin regular verbs were organized into four conjugation types according 
to the thematic vowel that linked the root (the lexical element) with the 
inflection. The thematic vowels were A, E, Eand I. Only the A-conjugation 
remained truly productive in all the Romance languages, though the I-con- 
jugation welcomed new words in earlier periods. The two E-conjugations 
have tended to coalesce: reasonably well differentiated three conjugation 
systems persist in Central Italian, Friulian and Bearnese-Aragonese. 
Rumanian is the only language to retain four conjugations (a fifth in i rep¬ 
resents a phonetic variant of the /-conjugation) but some of the forms that 
make the distinction - like the preterite - are not in common use. The most 
usual organization of the verbal system allows for two conjugations - a 
and i most frequently - but French distinguishes them in comparatively 
few forms. The infinitive, used as the citation form, is usually the most dif¬ 
ferentiated; person 5 forms come a close second. Latin E-verbs are among 
the most frequent, and can survive as irregular forms or are incorporated 
into another conjugation: 

(94) dIc£re ‘to say’: French, Ital. dire. Span, decir, fac6re ‘to do’: French faire, 
lidX.fare, Cat.fer. 

Note that esse ‘to be’ seems to have been incorporated into this conjugation in 
Proto-Romance. 

The E-verbs figure prominently among auxiliaries and modals: 

(95) HABERE ‘to have’, debere ‘to have to’; also ’^volere ‘to want’ for velle, 
*P0TERE ‘to be able’ for posse, *sapere ‘to know’ for SAPfiRE. 


All the Romance languages make use of an -sc- infix, originally with 
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inchoative force, in some verbs. In Spanish, Portuguese and Sard it is 
totally lexicalized and occurs through the relevant paradigms: 

(96) Span., Ptg. fenecer < finTre ‘to finish’,patfecer < *patIre (patI) ‘to suffer’; 

Sard albeskere ‘to dawn’ < *albIre ‘to whiten’ (see Rum. albt). 

In Latin some inchoative forms were already lexicalized: 

(97) Nosco ‘I come to know’ / nouT ‘I knew’. 

In Other languages the affix appears only in certain forms: the vast majori¬ 
ty of verbs in the i-conjugation are affected, but the choice of verb is 
unpredictable: 

(98) French finissons ‘we finish’ / finir ‘to finish’, Ital.^nisco ‘I finish’ / finiamo ‘we 
finish’; Rum. pdtescipatim. Cat. pateixipatim ‘I / we suffer’. 

In Swiss Rhaeto-Romance the infix is also used in the a-conjugation: 

(99) Engadine gmtulesch ‘I congratulate’ / gratuler ‘to congratulate’. 

Another affix (< -IDIO, of Greek origin) is used in Rumanian, and in some 
Southern Italian varieties, in about half the verbs of the a-conjugation: 

(100) Rum. lucrezHucreazdHucram ‘I/he/we work(s)’. 

The advantage of these affixes was that they facilitated regularization of 
paradigm stress placement. In the verbs affected the stress fell regularly on 
the syllable following the lexical root, which consequently showed no 
apophony (see 7.3). Compare: 

(101) French meurtimourons, Ital. muorelmoriamo ‘(he) dies / (we) die’ (< ’"morIre 
for morT), alongside French abolitlabolissons, Ital. aboliscelaboliamo ‘(he) 
abolishes / (we) abolish’ (< *ABOLrRE for abolere, abolescere); Rum. 
omoard/omorim ‘(he) kills / (we) kill’, alongside ura^telurim ‘(he) hates / (we) 
hate’ (< odIre). 

Thus, though the lexical material of the Romance conjugation systems 
is nearly all of Latin origin, the overall organization has changed consid¬ 
erably, mainly in the direction of simplification, but also of stem elabora¬ 
tion, of suppletion and of lexicalization. A consequence is that the 
Romance conjugation systems share a good many features not present in 
Latin, some of which may be Proto-Romance. 
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(ii) Infectum and perfectum 

As described by Varro (c. 116-37 BC), Latin verb morphology has as a 
basic feature the difference between the present (INFECTUM ‘unfinished’) 
and perfect (perfectum ‘completed’) stem. The latter normally is an exten¬ 
sion or modification of the former (as in ama-/amau-, cred-/credid- or 
FAC-/FEC-), though in some verbs there is suppletion (as in es-/fu). The 
association of the forms respectively with imperfective or perfective aspect 
holds only to a certain degree; there is no distinct form equivalent to the 
Greek aorist, and its functions are filled by the present, or historic, perfect 
(AMAUi) . The present stem has three finite indicative tenses - present (the 
timeless unmarked form), future and past. These two last both have a -b- 
afhx (amabo /amAbam, videbo/videbam) which probably derived from a 
form of the verb ‘to be’. The fi- and T-conjugations, however, have no -b- in 
the future (regam/regebam, dormiam/dormiebam): the asymmetry may 
be one factor that accounts for the complete disappearance of the Latin 
future from the Romance languages (see 4.5). The other two present-stem 
indicative forms - the present and the past imperfect - survive in all the 
Romance languages. Of the two parallel subjunctive forms the present sur¬ 
vives everywhere; the imperfect subjunctive (amarem) disappears, or per¬ 
haps develops as a personal inflnitive (see 4.3). The imperative forms that 
survive in Romance usually mirror the equivalent indicative forms. 

The perfect-stem forms have a more chequered career. The present per¬ 
fect (amaui) survives with preterite (punctual past) functions, especially in 
Western Iberian, Occitan and Southern Italian varieties, but only margin¬ 
ally in French, Catalan, Rumanian and northern Italy, being lost com¬ 
pletely in Sard and some other varieties (see below). In its present perfect 
functions it is replaced nearly everywhere by compound forms (see below). 
The future perfect (amauero) merges with the perfect subjunctive 
(amauerim) and survives marginally as a future subjunctive (see 3.4). The 
pluperfect subjunctive (amauissem) is used as a past subjunctive form in 
several languages, and in some areas is rivalled in this function by the 
pluperfect indicative (amaueram), which only marginally retains any 
pluperfect function, usurped in most places by a compound form. Some 
languages have kept a greater number of Latin verb-forms than others; 
for discussion of relative conservatism in Romance see 7.7. 

Each of the active Latin forms had a passive equivalent. In the present- 
stem forms all were marked by an -R (possibly originally a nominalizing 
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suffix). All of these were lost in Romance, which favoured the compound 
perfect forms (amatus sum etc., but see 4.7). 

Thus the Romance verb system, although using inherited material, is 
fundamentally different from that of Latin. It is usually assumed that the 
introduction of the new compound perfects and the loss of the Latin 
future and present-stem passives were effected at the Proto-Romance 
stage. Other changes are thought to have happened after the split-up of 
the languages, sometimes in parallel and sometimes as a result of inter¬ 
influence between the languages. 


(iii) The ‘imperfect’ 

The Latin, and hence Romance, name given to the amabam for ms is some¬ 
thing of a mistranslation of the Greek equivalent (paratatikos), which 
implied not unfinished action, but an action on which no time limits are 
placed by the speaker. It survives in every Romance language (and even in 
some creoles) as a past tense, with unmarked semantic characteristics that 
allow for a range of discourse interpretations. Most usually it is seen as 
implying continuous or habitual action in the past, but it also takes on 
almost modal nuances, some of which hint at some distancing of the 
speaker from the action, as in indirect speech or conditionals. It can also be 
used as a conscious stylistic device to give an impressionistic tinge to narra¬ 
tive description. It is a tense that is comparatively little used in colloquial 
styles, and is found more often in subordinate than in main clauses. 

The Romance forms of the imperfect are not, however, straightforward 
reflexes of the Latin forms. The A-conjugation endings from -aba- are 
comparatively unproblematic, but the other conjugations have clearly not 
developed regularly. The i-conjugation in Latin was already out of line, 
with its -lEBA- endings; the -iba- type endings of central Italian varieties 
(and of Aragonese-Bearnese and Friulian) may be survivals of a pre- 
classical type, or reformed to a three-conjugation pattern: 

(102) Ital. cantava, vedeva, dormiva ‘he sang, saw, slept’. 

Old texts support the latter hypothesis, but they may be influenced by 
other B-less varieties - Sicilian, for Italian, and Castilian, for Aragonese. 

What is particularly odd, though, is the irregular loss of the Latin -b- in 
the E- and i-conjugations: a frequent pattern is of two endings -aba- and 
-lA- (found in the Iberian peninsula, southern France and much of Italy): 
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(103) Cat., Ptg., Occ. cantava. Span, cantaba! Occ. vezia. Cat., Span, ve/a, Ptg via / 
Cat., Ptg., Occ., Span, dormia. 

In northern France the one surviving imperfect ending (older -oi-, -ei-, 
modern -ai-) is assumed to derive from -EBA-, with irregular loss of -B- 
(earlier oe-, -eve- from -aba- having fallen out of use): 

(104) chantait, voyait, dortnait. 

The text-book story that the loss of -b- was by analogy with a postulated dis- 
similatory fall of the second -B- in habebat never carried much conviction, 
and indeed originated only in a tentative suggestion. 

A more promising line of enquiry may be to recognize that in early 
Romance intervocalic -B- (which merged with consonantal u [w] as [P]) 
tended to fall, as it did regularly in Rumanian and some other varieties: 

(105) Rum. cintd, video, dormea. 

We can then ask why it should have been retained (or restored) in some 
circumstances. 

There is certainly a correlation between the maintenance of a three- 
conjugation system and -aba-/-eba-/-iba- type endings. There is also some 
indication that the -b- mark may have been reinforced, in some languages, 
as an aspectual marker to distinguish the imperfect from the preterite. 
These are often languages in which the compound perfect coexists, in 
aspectual contrast, with the simple preterite. 

In other cases, however, it seems that the preterite and the imperfect 
have tended to merge, as the temporal aspects of the forms were empha¬ 
sized; in these circumstances the imperfect -b- marker would be more like¬ 
ly to fade away. Signs of merger are found in southern Italy and in older 
varieties in the Iberian peninsula and southern France. 

Modern French may, as elsewhere, represent a special case. Here the for¬ 
mal homogeneity of the tense was reinforced by the analogical spread of a 
single marker through the conjugation system. In the spoken language the 
compound, originally perfective, form is the only other past tense. It is the 
quasi-modal, aspectual, and not the temporal flavour of the imperfect that 
comes to the fore. The ‘fuzzy’ passe-partout character of the French 
imperfect may well be reflected in its morphology, which stresses the 
homogeneity of the tense, rather than the morphological conjugation 
class of the lexical verb. 
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Another feature of the imperfect paradigms is that in persons 4 and 5 accentu¬ 
ation on the ending, as in Latin, is retained in Italian, French, Old Occitan and 
Sard, but elsewhere the accent has moved back on to the theme vowel, which is 
thus accented throughout the paradigm. 


(iv) The compound perfect 

Nearly all the Romance languages make some use of a compound perfect 
consisting of the present tense of an auxiliary verb and an originally pas¬ 
sive participial form (amatus ‘loved’ etc.). In traditional grammar this 
compounded form often has a name that implies a semantic import: in 
Italian passato prossimo ‘near past’, in French parfait ‘perfect’ or Spanish 
preterito perfecto ‘past perfect’, in Portuguese pretMto perfeito indefinido 
‘indefinite past perfect’. But a name which refers to the form - implying the 
compounding of two verbs - is sometimes preferred: perfectul compus, 
passe compose. Everywhere the auxiliary also can appear with tense forms 
other than the present, with functions of, for instance, pluperfect or future 
in the past etc. 

Some apparent ancestral constructions are attested in Latin. The transi¬ 
tion seems to have been something like: 

(106) Habeo litteras scriptas ‘I have (got) letters written’ > Ital. Ho le lettere 

scritte > Ho scritto le lettere ‘I have written the letters’. 

Originally this sentence could imply that the letters that I have had been 
written by someone else. Grammaticalization of the sequence involves a 
reanalysis by which the subject of the finite verb is necessarily assumed to 
be the subject of the participle, which is interpreted as active rather than 
passive. The two verb forms thus amalgamate to the equivalent of a simple 
verb form, and come to be used with intransitive verbs; for the selection of 
a HA VE or BE auxiliary for these see pages 15-24. 

The amalgamation is most complete in modern Spanish, where interca¬ 
lation of elements between the auxiliary and the past participle is very 
rare. Haber, having ousted ser as the auxiliary of unaccusatives, has virtu¬ 
ally lost its original lexical meaning of ‘to have, to possess’, in which sense 
it is replaced by tener ‘to hold’. In many other varieties the retention of the 
BE auxiliary for certain verbs, and the possibility of agreement of the past 
participle with an expressed object, seems to indicate less complete gram¬ 
maticalization (see 6.5). 
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In southern Italy and Western Iberian (including Latin America) the 
compound perfect is little used. In colloquial Portuguese (as in some 
Southern Italian varieties) ter < tenere ‘to hold’ has completely replaced 
earlier haver, as an auxiliary as well as a lexical verb. The Portuguese com¬ 
pound does not in the indicative have the same present perfect functions as 
other Romance forms, but is used to indicate continuing, just completed 
or repeated actions in the recent past: 

(107) Tenho feito muitas veces esse Irabalho ‘I’ve done this work many times’; Tenho 

acabado o meu discurso ‘I’ve got my talk over’; Tenho estado doente ‘I’ve been 
ill (lately)’. 

In the medieval varieties of both Spanish and Portuguese, tenere 
reflexes competed with those of more frequent habere as perfect auxil¬ 
iaries. Although the Spanish tener forms gained ground until the seven¬ 
teenth century, they were not usually found in intransitive constructions, 
and the movement towards their grammaticalization was slowed down at 
the time of standardization of the state language, perhaps because arbiters 
perceived a difference of meaning between the competing forms. In the 
modern standard tener + participle constructions are not completely 
grammaticalized, usually involving participial agreement with the object, 
and they tend to be constrained lexically, appearing to retain some rem¬ 
nants of the semantic content of tener, implying possession or duration. 
In Portuguese, on the other hand, although haver forms were more fre¬ 
quent in the earlier texts, ter forms began to outnumber them by the early 
modern period, triumphing at the time of standardization. More or less 
simultaneously non-agreement with expressed objects became regular. 

The Romance compound form was originally used only to indicate 
actions completed in the present, compared with the narrative simple past 
preterite form, which carries names like ‘remote past’, ‘past definite’ or 
‘past historic’. However, in many modern Romance varieties, the com¬ 
pound forms have replaced the simple past forms in colloquial speech. 
This can be seen as a shunning of the morphological complexities of the 
simple past (see 7.3), as part of a general drift towards analytic construc¬ 
tions, or as connected with the emphasis on the here and now in colloquial 
usage. It is assumed that the process was completed only comparatively 
recently as standard literary languages retain the older distinction. 

In spoken standard Italian and Spanish the semantic distinction between 
the simple and compound forms persists. To some extent this distinction 
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may be regarded as somewhat literary. In those varieties in which the sim¬ 
ple form persists - for example, in Latin America and southern Italy - it is 
much more frequent and unmarked semantically, with the compound form 
tending to be reserved to refer to events in the immediate past. 

In the Spanish of the Andes the past tense of the compound form (pluperfect) 
seems to imply lack of personal commitment to the statement made, rather 
than past timing: 

(108) (Bolivia) y en nada hablan encontrado trabajo ‘and they haven’t found work 
(apparently)’; Habian sabido fumar ‘They do smoke (I gather)’. 

This usage is to be compared with that of the future or conditional perfect in 
some other languages: 

(109) French Elle aura fait cela ‘She did that (apparently)’, L'accuse aurait tue trois 
personnes ‘The defendant (allegedly) killed three people’. 

In some languages - in Occitan varieties and in the Oltenian dialect of 
Rumania, for instance - it is held that the compound is limited to reference 
to events that have taken place on the same day as the act of speech. 
Seventeenth-century French grammarians drew up a similar ‘twenty-four- 
hour’ rule for standard French, but it is not clear how far this was reflected 
in actual usage. Usually, however, the distinction between an event that 
took place in the past and one that was completed by the present has been 
lost in colloquial speech. Western Iberian and Southern Italian varieties, I 
repeat, cleave to the simple form, whereas most others use only the com¬ 
pound form for unmarked past reference. In modem spoken French and 
Northern Italian varieties, where the compound past has definitively tri¬ 
umphed, the preterite is nevertheless sometimes retained with stylistic force 
for certain types of narrative and so carries prestige as a quaint rarity. 

We should mention the use in some varieties of the surcompose forms, like: 

(110) J’ai eu fait ‘I have had done’, for pluperfect J’avais fait or past anterior J’eus 
fait ‘I had done’. 

In French the form is regarded as non-standard, although it has a respectable 
history. Similar forms are current in those other varieties that use the com¬ 
pound perfect. 
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(e) Vocabulary 


Although nearly all common Romance words are of Latin origin, we 
should note that not all common Latin words survived as popular forms 
into Romance. One count shows that 104 out of a list of some 5,000 ‘rep¬ 
resentative’ Latin words disappeared completely from the inherited vocab¬ 
ulary. The following list gives some examples of common Latin words that 
were lost in Romance, and of their replacements. 

Romance replacements for some common Latin words 


AUXILIUM 

DISCERE 

FIERI 

HIEMS 

LOQUI 

MEMINISSE 

PUER 


RUS 


‘help’: *ADJLTARE a derivative of iuvere ‘to help’ gives nominal forms in all 
the languages. 

‘to learn’: ad-prehendere ‘to get a hold of’ gives most Romance forms. Ital. 
imparare is derived from parare ‘to prepare’. Rum. a mvafa seems to be 
derived from vitiu ‘vice’ (presumably meaning ‘to eradicate vice’). 

‘to become’: the usual Romance word is from a derivative of ‘to come’ de- 
venire. It is possible that the Rumanian fi form (‘to be’) is from a corruption 
of FIERI. 

‘winter’: the adjectival form hibernu is pan-Romance. 

‘to speak’: fabulare, narrare, parabolare, rationare are among the 
Latinate forms that have replaced the classical word. 

‘to remember’: memorare ‘to call to mind’ is the most widespread word, but 
formations from MENTE ‘mind’ and corde ‘heart’ are also frequent. SuB- 
venire ‘to come to aid’ supplies, in reflexive form, Italian and French words, 
‘boy’: French garfon is of Frankish origin, and has spread to Occitan. Span. 
mozo, muchacho, Ptg. mo(o are of uncertain origin: etyma proposed include 
words for ‘young (of wine)’, and ‘close-cropped (of hair)’. Span, chico (also in 
Ital. dial.), nino. Cat. nen, nin, Ptg. menino may be playful in origin. Ptg. rapaz 
is from rapace ‘rapacious’, manceho from mancipius ‘slave’. Cat. not and Ital. 
ragazzo are of unknown origin. Ital./ancii/Z/o is possibly a derivative of fante < 
INFANTE. Rum. tinar is from TENERU ‘tender’. 

‘countryside’: most languages use some form of campu ‘field’. Rum. prefers 
prd < TERRA. 


Many of the lost words were replaced by other forms apparently of 
Latin origin, and most were reimported in some form as Latinisms; 


( 111 ) 


French auxiliaire (AUXILIUM), bellicose (bellum), civil (civis), disciple (dis- 
CEKE), puerile (puer), rural (RUS, ruris). 
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3.4 LATIN INFLUENCE 

As we have seen, the legacy that Latin bequeathed to Romance is not lim¬ 
ited to what is handed down from generation to generation by continuous 
transmission. Indeed what look to the superficial observer the most obvi¬ 
ous shared features of the languages are sometimes quite late innovations. 
This applies to some apparent shared grammatical features, which may 
have become established in the standard languages by the work of gram¬ 
marians schooled in the Latin tradition, but most especially to ‘learned’ 
vocabulary that was incorporated into each of the languages more inten¬ 
sively at some periods than others, and which supplemented or even dis¬ 
placed inherited lexical items. 

(a) Spelling 

Even where differences of pronunciation mask the similarities between 
Romance languages, their similar spelling conventions usually enable 
speakers of one Romance language fairly easily to read another. All the 
Romance languages, even the Rumanian of ex-Soviet Moldavia (Moldova), 
use the Latin alphabet: Daco-Rumanian abandoned the Cyrillic alphabet 
in the nineteenth century, when it was asserting its individuality in face of 
its Balkan neighbours, and Moldavian switched only since the collapse of 
the USSR. 

Each of the languages developed methods of representing sounds that 
did not exist in Latin. The letter h proved a useful diacritic, as the letter was 
not sounded in Latin: thus ch could denote [tf] or [f], as in Spanish, French, 
Occitan and Portuguese (but [k] in Italian), and Ih, nh could be used for [A], 
[P], as in Portuguese. H alone could be used for the new Castilian sound 
that developed from Latin F (probably originally [4>], then [h] until it 
became silent in the sixteenth century). It must have had a similar use in 
Old French in hors < foris ‘outside’ (cf. Ital./Mora), and it was also used for 
Germanic [h] as in hdche ‘axe’ < Frankish *hapja. Today, in most varieties, 
this is, like Latin H, silent, but it prevents elision of a preceding vowel: 

(112) la hdche / I’habit < habitus ‘style of dress’. 

Other superfluous letters that are frequently brought into play are z and 
X, of Greek origin: the former is used for dental affricates [ts] or [ts] in Old 
French and Old Spanish, and [dz] or [dz] in Italian; the latter for us in Old 
French, and for [J] in Catalan. Use is also made of j and v, graphical vari¬ 
ants of i and u, to represent non-Latin sounds like [t^] and [v]. 
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Usually spelling harks back to the Latin form, even when the pronunci¬ 
ation has changed, so Spanish or Catalan //, Old French ill, and Italian gl 
should be read as [A], reflecting generalization of spelling of ancestral 
forms. French shows vowel nasalization by postposition of n, whereas 
Portuguese superposes the tilde, which was originally an abbreviation for a 
nasal consonant. Other diacritics include a miniature s - cedilla - under 
the Latin etymological letter, to indicate assibilation, as in French f or 
Rumanian f, and the Spanish tilde used in h, indicating an etymological 
geminate, pronounced as [ji]. 

Modern spelling reforms introduce other diacritics like the circumflex', 
which in French indicates the loss of an etymological letter, and in 
Rumanian the centralizing of i [4]. (Note that the older graphy d is retained 
in the name of the language roman to underline its ‘Roman’ status.) 

The spelling, in the languages that used writing in medieval times, 
reflects the various conventions used in monastic scriptoria, which were 
more used to copying Latin texts than composing vernacular ones. 
Reforms, from the sixteenth century on, were usually designed to bring the 
spelling nearer to the spoken word, often by eliminating superfluous ety¬ 
mological letters and by standardizing the graphy for a particular 
phoneme. 

Languages that saw rapid phonological change, like French or 
Portuguese, have been driven to make successive minor reforms, while 
others, like Spanish and Italian, retain more or less their Renaissance 
appearance. Always reforms have been bridled by the desire to retain a 
patent link with Latin, which, for all but Rumanian, remained the prestige 
language of religion and learning. 

(b) Grammar 

It is easy enough to point to the controlling hand of Latin in Romance 
spelling. More difficult is to estimate the extent of Latin influence on 
grammar. Naturally, metalinguistic discussion was, until very recent times, 
wholly couched in terms of the Graeco-Latin tradition. Grammarians 
would talk as if the vernaculars had nominal case systems - though the 
manifestations would usually rely on prepositions - and verbal perfective 
and imperfective systems. They would emphasize the requirement of com¬ 
plete recitable paradigms and a whole array of tenses in the verb system. 

Grammar would be seen as almost identical with morphology, and 
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inflected forms would be seen as somehow superior, more elegant than 
analytic sequences with similar functions. There would be insistence on 
accurate agreement in number and gender, as necessary to signal relation¬ 
ships of units within the sentence - even though in Romance, unlike liter¬ 
ary Latin, word order usually carries the burden of this function. When in 
doubt, traditional grammarians would turn to Latin models in order to 
describe, explain or even distort features of their vernaculars. 

Most important, they sought, for their own languages, to recapitulate 
the prestigious status of Latin, incorporating into them the values associat¬ 
ed with good Latinity - concision, elegance, clarity, musicality. In Dante’s 
term, the volgare illustre (the illustrious vulgar tongue) had to aspire to the 
standing of Latin, in order to be acceptable as a language of culture. 

How far, however, the grammarians influenced everyday usage is diffi¬ 
cult to assess. In French, it is a commonplace to point to the wide discrep¬ 
ancy between standard soigne style and racy street idiom. This may have 
existed since the mid-seventeenth century. In Italian the standard is often 
said to have remained static since the sixteenth century or even earlier: it is 
usually contrasted with spoken dialectal usage. In Spanish the standard, 
firmly based on the innovating dialect of Castile, has been regulated since 
the late sixteenth century. Standard Portuguese asserted its independence 
from Spanish a little later. In the peninsula, though, other Romance vari¬ 
eties have remained vigorous to the present day. 

The morphological similarities between Latin and Romance can, as we 
have seen, be regarded as consequential on uninterrupted development 
over time. In syntax, however, there are modern uses which one suspects 
might not have survived without the buttress of Latinate grammar. Let us 
examine three possible examples: use of the subjunctive, the ‘accusative 
and infinitive’ and negation. 

(i) Subjunctive 

As we have seen, the forms of the Latin present subjunctive mood survive in 
all the languages - however, not without a good deal of syncretism with the 
indicative. In Rumanian it is marked mainly by the special complementizer 
sd (< si). Other tenses of the Latin subjunctive were less resistant. In spoken 
French, for instance, where the present subjunctive is differentiated from the 
indicative only in comparatively few forms, the ‘imperfect’ or past subjunc¬ 
tive has ceased to be used - though still learned as a written form. 
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In all the languages a ‘past’ subjunctive derives from the Latin pluper¬ 
fect subjunctive: 

(113) (cant)-a(ui)sset > Rum., Span, -ase, Ital., Ptg. -asse. Old French -ast, mod¬ 
ern -at. Cat. -es, R-R (Engadine) -ess. 

In Spanish and Portuguese it is rivalled by forms deriving from the pluperfect 
indicative (see 7.5). In Catalan there is also a ‘perfect’ subjunctive compound¬ 
ed with the GO auxiliary. In some Francoprovenpal varieties the etymological 
past subjunctive has virtually ousted present subjunctive forms. 

In Portuguese, and in older Spanish (dying out in the eighteenth century), 
there is also a ‘future subjunctive’, deriving from what seems to have been 
a merger between the Latin future perfect indicative (e.g. -avero) and the 
perfect subjunctive (-averim) - Span, -are, Ptg -ar. It can be questioned 
whether it does indeed refer to events in the future, or rather to those that 
could occur earlier than some specified reference point of time: 

(114) Ptg. Quando chegar, eu voii sair ‘When he arrives, I’ll go’. 

In Portuguese and Galician the future subjunctive is identical for regular verbs 
with the ‘personal infinitive’, which tends to replace it in uneducated speech 
(see 4.2). In Galician a present subjunctive would be used in (114), as in Spanish. 

Some see the subjunctive as having essentially timeless reference, expressing 
the speaker’s lack of commitment to the reality of the event mentioned. 
Others point to the emotive import of the subjunctive, expressing the atti¬ 
tude of the speaker to the event he reports, or conveying optative or volitive 
nuances. In any case, differences of tense form depend automatically on 
sequence conventions, and not on real time setting of the event reported. 
Use of the ‘past’ subjunctive can in some languages imply a greater distanc¬ 
ing of the speaker with regard to the veracity of the report. 

All the core Romance languages do show indicative/subjunctive distinc¬ 
tions, however minimally. Descriptions in traditional grammars, which 
particularly cherish the distinctions, usually claim that use, with optative, 
voluntative or potential semanticism, has remained virtually unchanged 
since Latin. 

Romance now scarcely uses the subjunctive in main clauses, however. It 
survives best as an optative in person 3 ‘imperatives’: 

(115) Ital., Span. (\)Vengal ‘Let him come!’ (or polite ‘Come!’) 

and in negative imperatives in the Iberian languages: 
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(116) Span. iNo vengasl. Cat. No vingui!, Ptg. Nao venhas! ‘Don’t come!’ 

In some languages person 3 imperatives are accompanied by a comple¬ 
mentizer, in keeping with the intuition that they are governed by an under¬ 
lying verb of command: 

(117) French Qu’il vienne!. Rum. Sd vind!, Cat. Que vingui!, ‘[I command] that he 
come’. 

In general, the Romance subjunctive is a form used principally in subor¬ 
dinated clauses, reflecting some Latin uses more faithfully than others. In 
many cases it can be viewed as merely an agreement feature, a servitude 
grammaticale, which serves to reinforce the semanticism of the governing 
verb or conjunction, usually implying volition or the lack of certainty 
inherent in anticipated events. 

In other cases the choice seems one of style, with the subjunctive adding 
a certain air of elegance, to the subordinated postposition. This is particu¬ 
larly so, for some languages, after verbs indicating pleasure, anger, fear 
and the like. Sometimes, where the choice of mood is prescribed by stan¬ 
dard grammars, the actual verbs involved may differ: 

(118) Ital. Spero che venga, French J'espere qu’il viendra ‘I hope he’ll come’. 

In the Iberian languages esperar may be translated as ‘to hope, to wait for’ when 
followed by the subjunctive, and ‘to expect’ when followed by the indicative: 

(119) Span. Espero que venga I vendrd. ‘I’m waiting for him to come,’ ‘I hope he’ll 
come’ / ‘I expect him to come.’ 

With verbs of saying and thinking, where Latin used the accusative and infini¬ 
tive construction, a finite verb clause is used in Romance, and choice of the 
subjunctive adds a nuance of doubt about the subordinated statement. In 
French, hesitation in usage, even at the time of standardization of the lan¬ 
guage, led to the formulation that the subjunctive be used after the negative or 
the interrogative (but not the negative interrogative). In the colloquial lan¬ 
guage only the indicative is used. 

One construction where the subjunctive has semantic import is in restric¬ 
tive relatives, where the antecedent is understood as non-referential when 
a subjunctive follows: 

(120) Ital. Cercounaragazzache sappiad sa)ilfrancese'. Span. Busco una chica que sepa 
(habe) elframes', French Je cherche une fille qui sache (Isait) le frangais. Rum. 
Caut o fatdsd(/care) glie francez. ‘I’m looking for girl who knows French’. 

In French this sort of distinction is rarely made in the colloquial language, 
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where the subjunctive is automatically used after volition verbs, but other¬ 
wise occurs rather erratically in the usage of uneducated speakers. In 
Italian there is, to a lesser extent, a tendency for the subjunctive to be 
avoided in many contexts. In these standard languages, it may be that the 
dead hand of Latin has forced the retention of a range of uses. 

In the Iberian standards and in some non-standard dialects, on the other 
hand, the subjunctive seems to have retained more vitality in spoken usage, 
and it is difficult to say whether Latin has had the same sort of influence. In 
Rumanian, where Latin influence must be discounted, subjunctive finite 
complement clauses are used where other languages prefer an infinitive (see 
4.3); here the influence of neighbouring Greek seems to be the culprit. 

It can of course be maintained that the sort of context in which the sub¬ 
junctive is used - principally in elaborated discourse, with subordinate 
clauses - is in any case rare in unplanned everyday spoken usage. Certainly 
the subjunctive does carry with it an aura of refinement, which comes into 
play especially in more formal speech. 

(ii) Accusative and infinitive 

In Latin, clausal complements of verba sentiendi and declarandi, per¬ 
ception and saying verbs, had their subject in the accusative case and their 
verb as an infinitive. In Romance this construction survives with percep¬ 
tion verbs: 

(121) VOLUCRES viDEMUS CONSTRUERE NiDOS: French Nous voyons les oiseaux con- 
struire les iiids, Ital. Vediamo costruire i nidi gli uccelli, Span. Vemos construir 
los nidos lospdjaros ‘We see the birds build the nests’. 

It has also extended to factitives, or causatives. Where Classical Latin 
used a subjunctive subordinate facere ut faciat later Latin developed a 
FACERE EUM FACERE construction, in the sense of ‘to make him do’. This 
latter construction persists in the Romance languages (except in 
Rumanian, where a subjunctive clause or a supine construction is used): 

(122) Span. Hace escribir a Juan, Ital. Fa scrivere Gianni, Cat. Fa escriure en Joan, 
French IIfait ecrire Jean ‘He makes John write’. 

When the complement clause contains an object, its subject has to be marked 
by a preposition: 

(123) Ptg. Faz escrever a carta a/por Joao (not in Brazilian Portuguese), Span. Le 
hace escribir la carta a Juan (por is not used), French IIfait ecrire la lettre alpar 
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Jean, Ital. Fa scrivere la lellera a!da Gianni, Cat. Fa escriure la carta a en Joan 
(not per) ‘He makes John write the letter’, ‘He has the letter written by John’. 

In Portuguese however it is possible to use two direct objects (see also 4.3 for 
the personal infinitive): 

(124) Farei o Jose fechar a porta ‘I shall make Joseph close the door’. 

Complements of saying verbs in Romanee, however, normally take 
the form of a finite clause introduced by a complementizer, but in the 
modern Western standards a Latin-like infinitival construction can be 
used. As with the English equivalent, it is regarded as bookish and seems 
to date from the Renaissance period, when it was directly imitated from 
Latin. 

When the subject of the complement clause is identical with that of the 
saying verb, the infinitival construction is reasonably frequent: 

(125) French Je crois avoir raison. Span. Creo tener razbn, ‘I think I’m right’, ‘I 
believe myself to be right’. 

When the subject changes and is expressed as a pronoun, the subordinated 
verb ‘to be’ is omitted: 

(126) French Je le crois intelligent. Span. Lo creo inteligente ‘I think he’s intelligent’, 

‘I believe him to be intelligent’ (but see Ital. L'aveva creduta essere pazza ‘He had 
thought she was mad’). 

Otherwise the construction in the modern standards is favoured only 

when the subject of the complement clause is extraposed: 

(127) French lafille qu'il croyait etre venue. Span, la muchacha que el creia haber 
venido ‘the girl he thought had come’. 

In cases like these it is felt that the infinitival construction is more elegant than 
one with an accumulation of QU-conjunctions (see 7.4): 

(128) La fille qu’il croyait qui est venue-. La muchacha que el creia que ha venido. 

There can be little doubt that the extension of the accusative and infinitive 
construction to verbs of saying in the modern standards was directly influ¬ 
enced by the Latin model, in Romance as in English. However, it has 
remained, more so than in English, a literary, even precious, feature of the 
languages. 
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(iii) Adjective placement 

Language typology studies have shown that SVO languages tend to postpose 
adjectives to the noun, whereas SOV languages prepose them. Latin is, like 
other ancient Indo-European languages, a basically SOV language 
(though in main clauses surface word order is determined largely by prag¬ 
matic considerations) and the modern Romance languages are basically 
SVO. It is to be expected then that there would have been a change of 
adjectival ordering from pronominal to postnominal, in the transition 
from Latin to Romance. 

Romance adjectives do normally occur after the noun they modify, 
though with some noteworthy exceptions, especially in the modern stan¬ 
dard languages. In Classical Latin, however, as distinct from the earlier 
period, it is far from sure that prenominal adjective placement was the 
rule. In the grammar books a distinction is often made between determin¬ 
ing (or ‘emphatic’) and attributive (or ‘descriptive’) adjectives, the latter 
appearing most frequently after the noun and the former before. 

In the modern Romance standards, though much less so in non-standard 
varieties, a similar distinction is made. What is now a separate class of 
determiners normally occur before the noun, but otherwise adjectives can 
be preposed or postposed. In many cases there is a semantic difference: 

(129) French families nombreuses. Span, familias numerosas ‘large families’ / nom- 

breuses families, numerosas familias ‘numerous families’; Ital. un nuovo vestito 
‘a new (i.e. different) dress’ / un vestito nuovo ‘a (brand-)new dress’; Ptg. vdrios 
papeis ‘several papers’ / papeis vdrios ‘miscellaneous papers’. 

In literary and pretentious official style, moreover, preposed adjectives 
have become very nearly the norm. The justification given for the different 
placement is that the preposed adjective is more subjective and emotive, 
enhancing the semanticism of the noun (French doux miel. Span, dulce 
miel ‘sweet honey’) whereas the postposed adjective is contrastive {vin 
doux, vino dulce ‘sweet (not dry) wine’). The difference has been variously 
described in terms of restrictive and non-restrictive qualification, of pred¬ 
icative or attributive status, of intrinsic and extrinsic characterization, of 
reference and referent modification or of relative emphasis. 

In most of the languages a few common adjectives always are placed 
before the noun, indicating usually common scalar or affective notions - 
‘good-bad’, ‘young-old’, ‘small-large’, ‘fine, beautiful’ (bellus), ‘pitiable, 
poor’ (pauper), ‘dear’ (carus). They can be thought of as almost incorpo- 
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rated into their noun, and indeed suffixes express some of the same things 
in most languages. In modern usage lexical slippage can result in other 
adjectives being used in the same way {Air France vous souhaite un 
agreable voyage ‘Air France wishes you a pleasant journey’, perhaps influ¬ 
enced by the formulaic ^Bon voyage’). Such usage has been stigmatized as 
an erroneous Anglicism. 

Some adjectives can only with great manoeuvring appear before the 
noun. This is true in general of past participles and of ‘pseudo-adjectives’ 
in phrases like le discours presidentiel (‘the president’s speech’, not ‘the 
speech which is a president’), or un linguiste traditionnel (‘someone who 
does traditional linguistics’, or ‘a linguist who is traditional’). 

In the non-standard languages preposed adjectives, other than the half- 
dozen common ones, are rarely found (though it is to be noted that popu¬ 
lar discourse is rather short on adjectives in general). In literary language, 
extensive use is made of the stylistic device of preposing adjectives. It is 
tempting to see the stylistic habit as arising from conscious imitation of 
Latin, rather than directly inherited from Latin. 

In French there does seem to have been discontinuity in transmission, as 
Old French texts make much more use of preposed adjectives than mod¬ 
ern French. The habit of preposing is maintained in Northern and Eastern 
dialects: influence from German is usually invoked. A similar influence 
may be felt in the literary Old French: see 1.2 for the suggestion that the 
language of the earlier texts may be related to a variety more 
Germanically orientated than the modern language, which has reverted 
more decisively to the Romance type. Nevertheless the influence of Latin 
on literary style was felt from the time of the Renaissance and of stan¬ 
dardization, and preposing of the adjective became a device for tempering 
its qualifying characteristics, and especially for signalling figurative or 
emotive nuances. 

In Rumanian, though preposition of common adjectives was found in 
early texts {mare mild ‘great pity’), today this is so only in lexicalized 
forms: bundvoin{p ‘goodwill’ (though ‘poor, pitiable’ can still appear 
before the noun, biata Jara‘poor country!’). However in the modern stan¬ 
dard, presumably under French, rather than direct Latin, influence, 
preposition of other adjectives can be used with emphatic and affective 
import: zi frumoasd‘fmQ day’ but frumoasd cas'd‘lovely house’. Note that 
in cases like these a definite article is enclitic to the adjective and not to 
the noun: 
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(130) mareleprofesor ‘the great teacher’. 

The postposed adjective can be preceded by the so-called ‘adjectival article’: 

(131) calul celfrumos ‘the [very] fine horse’ (‘the horse the fine one’). 

(iv) Double negatives 

Another development that seems to date from the time of standardization 
concerns negation (for a more general discussion of negation strategies in 
Romance see 7.4). The Latin distaste for negative attraction - by which a 
negative element sparks off a string of agreeing negatives - is reflected in 
the attitudes of most grammarians of Western vernaculars. In spite of 
widespread colloquial usage (as in You ain’t seen nothing yet), grammari¬ 
ans go on insisting that two negatives make a positive as in mathematics, 
and that double negatives are ambiguous or downright incorrect. 

In Romance, where many of the negative items stand also as negative 
polarity items, there is comparatively little argumentation along these lines 
and verb negation can be accompanied by a string of negatives: 

(132) Rum. Nu ddniciodatdnimic nimdnui. Span. No da nunca nada a nadie, Ital. 

Non dd mai niente a nessmo, French II ne donne jamais rien a personae ‘He 
doesn’t ever give anything to anyone’ (‘He not gives never nothing/ nothing 
never to nobody’). 

However, in standard Italian, Spanish and Portuguese, when a negative 
item is fronted the following preverbal negative non, no or nao is deleted - 
a process called by some generative linguists negative incorporation. This is 
most usual when the negative item is the subject of the sentence (though 
the more usual construction relegates to post-verbal position): 

(133) Ital. Nessunoe venuto I Non e venuto nessuno. Span. Nadie vino! No vino nadie, 
Ptg. Ninguem veio / Nao veio ninguem ‘Nobody came’. 

It also, however, occurs when the object is fronted: 

(134) Ital. Niente ho visto I Non ho visto niente. Span. Nada he visto I No he visto 
nada, Ptg. Nada tenho visto / Nao tenho visto nada ‘I’ve seen nothing’, ‘I haven’t 
seen anything’. 

The negative incorporation rule seems not to operate in most Romance 
languages. In Catalan it is condemned as a Castilianism; in Sard and in 
central Italy, the use may be influenced by standard Italian. There are 
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signs in sixteenth-century Rhaeto-Romance and Rumanian texts that the 
rule may have operated then, but little reliance can be placed on what are 
mainly translations. 

In Spanish and Portuguese, usage without the rule is amply attested in 
earlier texts, though it roused the wrath of early grammarians. The present 
rule became firmly established in standard Spanish by the sixteenth and in 
standard Portuguese by the seventeenth century. Non-standard varieties 
tend not to observe it. 

In Italian the picture is not so clear, but it would appear that the rule 
was established by the time of the great fourteenth-century writers, and 
examples can be found in earlier texts, mainly translations from Latin. It 
does not operate in Northern varieties, but there is evidence of its use in 
Central and Southern varieties. 

Whether the construction persisted from the Latin period into Central 
Italian, or whether it was re-introduced under Latin influence is impossi¬ 
ble to tell from the evidence we have. For the Iberian languages it seems 
clear that the rule was introduced by prescriptive grammarians only at the 
period of standardization, when both Latin and Italian were exerting 
influence on grammatical thinking. It is not surprising that non-standard 
varieties do not always observe the rule, and comparatively rarely use the 
(emphatic?) fronted constructions. 

Another feature that seems to have been retained or restored by Latin 
influence is the so-called ‘expletive negative’ of the type: 

(135) Timeo ne VENiAT ‘I fear lest he come’. 

This survives in French literary style as ‘partial’ negation: 

(136) Je crains quit ne vienne. 

In older Italian (but more rarely in Spanish) no(n) was used with the 
subjunctive in complement clauses of this type, without an overt comple¬ 
mentizer: 

(137) lo temo forte non to cor si schianti (Dante) ‘I fear my heart might (not) 
break’. 

The use of a kind of negative concord, introduced by other ‘semantically 
negative’ verbs or conjunctions (including those expressing doubt) may 
have spread from this model and was particularly favoured by French. For 
the ‘expletive negative’ in clauses depending on comparisons of inequality, 
where Latin would use quam, often with a second comparative, see 7.5: 

(138) VERIOR QUAM GRATIOR POPULO ‘more truthful than (more) popular’. 
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(c) Vocabulary: Latinisms 

The vocabulary of the Romance languages is not only inherited through 
smooth transmission from Latin (popular words), but also has been (as to 
more than about 30 per cent in some languages) borrowed from Latin dur¬ 
ing the course of the languages’ history (learned words, Latinisms, 
cultismos). This is not true of Rumanian, which, except for the 
Transylvanian variety, had little contact with Latin through Church or 
learned sources. 

In standard Italian it is hard to judge, on the basis of their phonological 
shape compared with Latin, whether lexical items are popular or learned. 
As we have seen, Italian has tended to retain, or restore, words with 
Latinate shape: the invention of the literary language in the fourteenth 
century, and its standardization in the sixteenth, constantly looked back 
to the past as the model. Clearly popular forms are more current in the 
non-literary dialects than in the national language. 

To a lesser extent, the same is true of Spanish: one problem here is the 
proliferation of so-called semi-cultismos, which partake of some regular 
sound changes but resist others. The usual assumption is that they have 
been rebuilt, by association with the Latin equivalents heard continually in 
the Church services. This assumption depends, however, on a Neogram¬ 
marian model of sound change which holds that all items of a language 
are affected virtually simultaneously by a mechanical phonological shift: 
exceptions have to be accounted for within certain categories, one of which 
is that of loanwords. In some circumstances a Latin loanword is seen as 
contaminated with an existing popular word, the blend giving unexpected 
results: one much-quoted example is the word for ‘church’ from ecclesia 
(itself borrowed from Greek) which ends up in French as eglise, in Spanish 
as iglesia and in Portuguese as igreja. None of these looks like the Latin 
word, yet neither are they regular phonological outcomes, with expected 
palatalization (Italian chiesa, on the other hand, could be a regular form). 

It is in French, where there was radical regular sound change, that it is easiest to 
measure the extent of learned vocabulary. Here the fashion for borrowing from 
Latin had passed its peak by the seventeenth century, after which time the Latin 
words borrowed tend to be international, like laboratoire ‘laboratory’ etc. 

It would be a mistake to think that the term ‘learned’ implies that the 
relevant items are esoteric innovations. Indeed it is among the most com¬ 
mon words in the languages that these items occur. For less frequently 
used items (for instance, in a dictionary like Le Petit Larousse covering 
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some 50,000 items) the proportion of newer words is greater (about half 
of the current vocabulary entered the language after the sixteenth centu¬ 
ry), but these are not usually Latinisms. 

The periods which most favoured Latin borrowing in French were the 
fourteenth century (when translations from Latin were commissioned by 
royal decree) and, to a lesser extent, the sixteenth century, when interest in 
classical culture was rekindled. But learned forms are found in texts from the 
very beginning of vernacular writing. It could be that the early attested 
Latinisms are merely an effect of graphical convention (with scribes writing 
as if the language were Latin but reading in vernacular). In Spanish and 
Portuguese the greatest influx of Latinisms seems to have occurred in the 
thirteenth century, when the vernacular started to be used for a new range of 
purposes, previously the province of Latin. It has been maintained that this 
was little more than a shift in writing practice, and that ‘medieval Latin’- the 
written language of culture throughout the Middle Ages - was bom of the 
change. Those writing in the new way, though, would probably have found 
the colloquial vocabulary inadequate for the more elaborated discourses, and 
so drew on the Latin sources that were readily available to them. 

To give some idea of the extent of Latin borrowing in the most fre¬ 
quently used sections of the vocabulary, let us look at a sample of words 
in the Franfais fondamental dictionary of the 2,000 most used words in 
modern French. Taking words beginning with the letter L, and leaving out 
grammatical items, I find that 29 per cent of items are Latinisms, com¬ 
pared with 46 per cent of popular words inherited from Latin. Some of 
these Latinisms - like labourer ‘to plough’, earlier ‘to work’ < laborare - 
occur in the earliest texts. Others replace more popular forms - like lac 
‘lake’, introduced instead of lai in the late twelfth century, or legume ‘veg¬ 
etable’, a fourteenth-century substitute for leiin. Post-sixteenth-century 
introductions are rare - one certain example is litre of 1793 - though some 
of the words changed their semantic scope in modem times. 

We have already seen that Latin borrowings brought French more deci¬ 
sively into the Romance ‘club’ in the late medieval period: spelling conven¬ 
tions and revised pronunciation based on these (particularly frequent 
since universal education got underway) have prevented French from 
drifting off as a consequence of radical phonological change. 
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3,5 WHO KILLED LATIN? 

The slickest answer to that question is the second Vatican Council meeting 
in 1962, which decreed that Roman Church services could henceforth be 
in the vernacular (but note that its proceedings were published in Latin). 
But it could also have been similar decisions of ecclesiastical councils in 
the ninth century, of which the most well known is the 813 Council of 
Tours, which decreed that homilies should be delivered in ‘rustic roman’ 
speech, so that the faithful could understand them. 

The most conventional answer is that there is no culprit - Latin is alive 
and well and living in the Romance-speaking areas - that Romance is 
Latin, under another name. But we have already seen that, although the 
name Romance was used interchangeably with Latin for a long time, at 
some period the languages were recognised as different. 

But when? Some say always - the spoken idiom was recognizably differ¬ 
ent from the written (which was presumably also a spoken form for an 
elite). Others say ‘since the spread of Latin through the Roman Empire’ as 
speakers of substratum languages shifted to Latin, bringing to it some of 
the features of their first language. Others say that the Latin language 
died, as a mother tongue, when Roman administration broke down in the 
fourth century, or with the influx of barbarian hordes into the Empire. 
Others will put the demise of Latin in the eighth century, when under 
Charlemagne there was a cultural renaissance which attempted to restore 
the written language to its former pristine purity, and so divorced it from 
the current colloquial use. Yet others will see the death of Latin as dating 
from the appearance of the first vernacular texts, which can be reliably 
dated to the tenth century at the latest. Feudal social and political organi¬ 
zation could then be linked to the breakdown of communication between 
different parts of the Western Romance area. These questions will be dis¬ 
cussed in more detail in chapter 6. 

Certainly we can say that Latin went into terminal decline from the 
early Middle Ages, but it may still have been viewed as a more prestigious 
version of the language spoken by ordinary people. The educated still 
took pride in knowing Latin, and the signs are that Latin did not start 
being taught through the vernacular until later. In the twelfth century we 
start getting hints that Latin was being regarded as a foreign language, just 
as the different Romance languages were being seen as distinct one from 
the other. In the thirteenth century, administrative documents began to be 
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written in vernacular, which previously had been used mainly for enter¬ 
tainment or religious instruction. 

But Latin went on being used, in a diglossic situation, for more elevated 
purposes well into the sixteenth century. In some parts of non-Romance- 
speaking Europe, Latin was still used for administrative purposes in the nine¬ 
teenth century, when it remained everywhere a language of scholarship. In 
the sixteenth century, when there was another revival of classical standards, 
it may be that educated people were still able to follow sermons delivered 
mainly in Latin (their written versions were certainly in Latin). It is only in 
the twentieth century that Latin became virtually confined to Church func¬ 
tions (ending in the late 1960s), and was treated as an erudite sacred cow. 

One question that can be asked is: when did Latin, in so far as it was dis¬ 
tinguished from Romance, cease to be anyone’s first language? Recall that 
even for Erasmus, Latin was a second language: at the point of death he 
reverted to his mother tongue, we are told. The sixteenth-century French 
essayist Montaigne is usually quoted as the last first-language speaker. His 
father insisted (though with what success?) that even the servants should 
address him only in Latin during his early childhood. Montaigne himself, 
as a speaker both of Gascon and of French, claimed that, with his Latin as 
well, he could make himself understood in all Romance tongues (not 
including, of course, Rumanian). Later attempts to bring up children with 
Latin as their first language seem to have been doomed to failure - partly 
because of the difficulties in inventing new words for everyday objects. 

Here we should mention the idea recently put forward by Roger Wright 
that Latin did not die (it is represented by the Romance languages today), 
but that in the eighth century a new Latin was created - Medieval Latin. 
This was a consequence of the Carolingian reforms under the aegis of the 
English scholar Alcuin of York, who is said to have introduced a new way 
of reading Latin, different from the Romance tradition, in which Latin 
was read as if it were the vernacular. The claim here is that it was only 
after this time that diglossia began, with Latin being recreated as a High 
language, and Romance as the Low. Before this, speakers were not con¬ 
scious of the difference between Latin and Romance, merely of that 
between written and spoken codes. This controversial view depends on 
ambiguous evidence about the nature of the Carolingian reforms, and 
about the causes of differentiation of Latin into distinct Romance idioms. 
It also does not adequately account for the salient grammatical differences 
between the Latin of the texts and the spoken vernaculars; even if speak- 
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ers pronounced the Latin as if it were the local Romance variety, it is hard 
to believe that they were not conscious of the difference between the two. 

Many scholars now agree, however, with views that were advanced first 
in 1929 by Henri Muller, but for a long time treated with scorn, that the 
birth of the separate Romance languages - if not the death of Latin - 
dates from a much later era than the one suggested by the political col¬ 
lapse of the Roman Empire. For a long time speakers continued to view 
Latin as the codified, fixed version of their own more homespun everyday 
usage, which they quite naturally accepted as varying from place to place 
and from person to person. Recognition by each community of the dis¬ 
tinctness of its language presumably went hand in hand with the percep¬ 
tion of its collective identity. Latin was killed, then, by Romance speakers 
who perceived it as dead, beyond their reach, and superseded by a new 
prestigious language that could adequately fulfil all the functions previ¬ 
ously performed by Latin. The last blow then must have come from the 
Vatican Council, but the death had lingered on for many a century. 


FURTHER READING 
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Rohlfs (1984). On Latin morphology, Carstairs (1988); on Latin syntax and 
semantics, Panhuis (1982), Pinkster (1990); on Late Latin, especially 
Vaananen (1981,1987), Herman (1990), Herman ed. (1987), Calboli ed. (1989, 
1990). For the Roraance-Latin interface, Wright (1982), Dahmen et al. eds. 
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(1991). On accentuation, Pulgram (1975), Mayerthaler (1982). On word order, 
Bauer (1992), on case and nominal inflection, Gaeng (1984), Schosler (1984), 
Blake (1994). On umlaut, Leonard (1978), Maiden (1991). On negation, 
Haegeman (1995), Rizzi (1982), Schwegler (1990). On Romance verbs, Vincent 
& Harris eds. (1982), Pountain (1983). On auxiliaries, Harris & Ramat eds. 
(1987), Heine (1994); on compound tenses, Harre (1991). On tense, Comrie 
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the definite article Trager (1932), Abel (1971). 



4 


Convergence, interinfluence and 
parallel development 


4.1 DRIFT AND METARULES 

Where the Romance languages share idiosyncratic features classic method¬ 
ology dictates that their origin is sought in Latin, or at any rate in Proto- 
Romance. It is true that shared morphological and lexical features can 
often be traced back to Latin sources, but non-universal syntactic features 
may stem from ‘peripheral’ morphological or phonological accidents. 
However, we should take into account the considerable interinfluence 
between the different Romance languages throughout their history, dis¬ 
cernible most easily in the lexicon and in metalinguistic discourse, but also 
extending to syntax. 

One little-understood process that may affect the history of related lan¬ 
guages is drift, by which they independently develop in the same direction. 
An underlying, but often unstated, supposition here may be that the 
proto-language system was in some way unstable and that each language 
at its own rate, and in parallel but perhaps subtly different ways, sought to 
remedy any imbalance. An objection to this way of looking at things is 
that it seems to imply some sort of teleology, a striving for equilibrium, on 
the part of the language system. However, it can be claimed that ‘drift’ 
does not imply ‘remediation’ - for instance, the loss of noun-case mor¬ 
phology in English and French can be the result of non-functional drift 
(or ‘slope’, ‘slant’), merely in the sense that at successive points of time 
there is less case morphology about. 

Parallel developments, especially in phonology, can also be described by 
means of metarules, which are always potentially operational in the lan¬ 
guages, but are sometimes in abeyance, perhaps because input has been 
‘bled’ away by other changes, or because they are blocked by conservative 
social forces. The examples of palatalization, discussed in chapter 3, may 
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fit into this category - as a natural assimilatory process, it keeps popping 
up in a slightly different form all over the place from time to time. 

The concept of drift is compatible with the functionalist view of lan¬ 
guage, and indeed with the invisible hand approach, originating in Adam 
Smith’s view that individual attempts at maximizing advantage while mini¬ 
mizing effort can result in a collective directional shift. In similar circum¬ 
stances it will be likely that the outcome in different languages will be 
alike, and that differences could be attributable to extraneous factors. 

Is it also compatible with the view that new learners of a language, faced 
with excessively complex and degenerate input data, consequent on an 
accumulation of anomalies in the output of their seniors, utilize their lan¬ 
guage acquisition device (or universal grammar) to create a simpler, more 
elegant and consistent grammar that will incorporate the aberrant materi¬ 
al? Presumably only when in all the related languages there are similarities 
in the less-than-systematic material in the adult output would the newly 
created grammars come up with the same solutions. 

So we must again look to inherited peripheral phenomena, to the weight 
of tradition, to interinfluence and common external influences, to account 
for parallel developments. What all languages would have in common is 
the tendency to generalize and simplify rules to bring them into line with 
more basic principles, and probably to slough off tiresome features, espe¬ 
cially arbitrary lexical constraints, that have hung on only through inertia 
and tradition. Universal tendencies may go deeper than that, though, per¬ 
haps targeting more transparency in derivation or more iconicity, relating 
surface forms more closely to underlying semantics. 

Romance languages, like most other Indo-European languages, do seem 
to show a drift from heavily inflected synthetic word morphology towards 
more analytic structures with free units syntactically signalling grammatical 
relations. The progression has been viewed by theorists as either indicating 
decay and decadence from an earlier Golden Age, or as heralding progress 
in unravelling the parallel but tangled threads of form and substance. 

I have already pointed out that in Romance the analytic process is car¬ 
ried further in the nominal system than in the verbal. But even here there is 
development of what are traditionally called compound (‘composed’) and 
periphrastic forms, especially to mark aspectual features. More conserva¬ 
tive languages resist this trend which some see as an inexorable, though 
erratic, forging ahead towards a creole-like system of TAM marking (see 
2.2). In this framework of thought, the term frangais avarice was intro- 
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duced in the between-wars period to characterize a sort of French that had 
‘advanced’ far along this path in the vanguard of Romance. It was seen as 
a popular colloquial variety untrammelled by reactionary traditional 
grammar and pointing the way to the future of Romance. 

In this chapter I shall examine some of the possible candidates for the 
status of ‘convergent’ Romance developments. Except for lexical inter¬ 
change, they are all controversial and are most often seen as having their 
roots in the proto-language. I shall be making the case for seeing them as 
one facet of ‘the club effect’, by which the Romance languages have stuck 
together as a federation. I shall pluck examples from the realms of phonol¬ 
ogy (diphthongization), syntax (the infinitive in complementation and 
subordination, and object clitics), morphology (periphrastic forms, paying 
special attention to the passive and the future) and lexicon (mutual bor¬ 
rowing between the languages). 

4.2 DIPHTHONGIZATION 

So-called ‘Romance diphthongization’ is almost universally believed to 
date from Proto-Romance. Differences between the languages are usually 
associated either with divergent early chronological ordering or ascribed 
to late developments in the individual languages. However, it is possible to 
postulate parallel rather than common development. 

The type of diphthongization in question is that of Latin short fi and 6 
in stressed syllables. By the rules of Latin accent assignment (see 3.3) this 
covers the only accented vowel in monosyllabic or bisyllabic words, and 
the antepenultimate vowel in words liable to syncope. It will be recalled 
that classically it is assumed that, whereas in Sard all short and long vow¬ 
els merged, and in Rumanian short and long o merged, in most of the rest 
of the Romance area fi and (5 would be retained as the only short vowels 
that were distinct from their long counterparts (I and C having merged 
with E and o). They came to be, however, distinguished from their erstwhile 
long partners not by length, but by lower point of articulation. 

In traditional accounts the erstwhile short vowels are assumed to have become 
‘lax’ in Proto-Romance. Labov (1994) argues, on the contrary, that Romance 
vowels are consistently ‘tense’ (or ‘peripheral’ in his terms). Thus, unlike 
Germanic ‘non-peripheral’ vowels, Romance vowels tend to rise during chain 
shifts. Moreover the ‘escape route’, by which high peripheral vowels may 
switch to the non-peripheral ‘track’, is blocked in Romance: thus Romance 
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high vowels (especially [i]) tend to remain stable in Romance, whereas in 
Germanic they may diphthongize and lower. This would provide a partial 
explanation for the apparent limitation of Romance diphthongization process¬ 
es to mid vowels (but see also 6.4). 

Proto-Romance [e] and [o] resulting from short Latin fi and 6 are thought 
to have tended to diphthongize under stress. Reflexes in Romance vary but 
from some a diphthongized form can be reconstructed: 

(1) MfiL ‘honey’: Rum. miere, Ital. miele, French miel. Span. miel\ nOuu ‘new’: Ital. 

nuovo. Old French nuef. Span, nuevo. 

Such diphthongization can be seen in terms of raising of the first mora of 
the nucleus. 

Diphthongization of £ is attested in spelling very sporadically in Late 
Latin (e.g. niepos for nepos ‘nephew’ in an inscription dated AD 120) and 
is thought to date back to just after Sard lost contact with innovations in 
Latin. It is impossible to tell whether the digraph represents a nucleus with 
an onset glide ([je]) or change of point of articulation during pronuncia¬ 
tion ([ie]), nor whether it was a heavy (two-mora) or a light (one-mora) 
diphthong. Given parallels in some modern dialects (e.g. in Sicily), it has 
been suggested that the diphthong was a phonetic variant of the short 
vowel used in emphatic position. 

Diphthongization of short 6 is traditionally dated to a rather later peri¬ 
od, partly because Rumanian did not apparently participate in the process. 
The graphical form (mo), attested in a fifth-century inscription and then in 
one of the earliest French texts (buona for bona in the Eulalia poem of the 
ninth/tenth century), suggests raising, perhaps closing to the glide [w], or 
non-syllabic [u], at the beginning of the sequence. 

The standard explanation of this type of diphthongization (usually 
called ‘spontaneous’) dates back to Schuchardt (see page 4), who relates 
it to later diphthongization of Latin E and o in French (see 6.4). It is 
assumed that, with the loss of the phonological Latin quantitative system, 
vowel length was determined by phonetic factors: stressed vowels in free 
syllables were lengthened. The lengthened vowel could then suffer dissimi¬ 
lation ([ee] > [ie], [oo] > [uo] > [ue]). The assumption made is that ie and uo 
were heavy rising (though down-gliding) diphthongs with stress on the 
first element. This is rather puzzling as it is expected that the initial part of 
the heavy diphthong would retain its original timbre, with the final part 
tending to become a glide (a falling but up-gliding diphthong). 
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There is some evidence for the postulated stress patterning: in Old 
French verse ie and uo can assonate with i and u, and in some dialects 
today there is a (possibly metaphonic) ['io] diphthong derived from 6. But 
in most languages where the diphthongs persist they have an onset glide 
(/je/ or /ja/, /wo/, /we/ or /wa/; see also 7.2). 

Another difficulty about the (‘early’) Schuchardtian explanation is that 
in some languages - notably Spanish - the diphthongization occurs in 
checked (blocked or heavy) syllables as well as in free (light) syllables: 

(2) HERBA ‘grass’: Rum. iarbd, Dalmatian (Veglia) yarba. Span, hierba, R-R 
(Surselvan) jarva I French herbe, Ital. herbal FORTE ‘strong’: Span, fuerte, 
Dalmatian (Veglia) fuart I French fort, \ta\. forte. 

This fact has to be explained away by appeal to analogical spread of the 
diphthong. It could, however, suggest that the diphthong was originally 
light (one mora) like the original vowel, and that the effect of stress was to 
favour a shift of articulation, rather than lengthening. 

(a) Metaphony 

Another quite different explanation for Romance diphthongization (also 
originally proposed by Schuchardt) is that it originates as an assimilatory 
metaphonic (umlauting) phenomenon, sometimes called ‘conditioned 
diphthongization’. This can be related to breaking, by which a short fi or 0 
is partially raised by a juxtaposed palatal consonant or glide. This type of 
diphthongization is well attested in French: 

(3) vieil old’ (old [vjeX]) < veclu (for VETULUS),ye«i7/e ‘leaf’(old [fueA9])< folia. 

In Spanish and Italian, though, a palatal consonant sometimes has the 
effect of impeding diphthongization: 

(4) Span, espejo ‘mirror’ (Old [espejo]) < speclu (for speculum), Ital. meglio ‘bet¬ 
ter’ < MELIUS, Span, hoja (Old [hosa]), Ital./og//a < folia. 

We have already mentioned the contention that Proto-Romance was an 
umlauting dialect, and the widespread Italian morphologization of 
umlauted forms, paralleled to some extent in Asturias in Spain. An I, and 
sometimes a C in the post-tonie syllable, eould induce raising of the tonic 
vowel. Normally this means that [e] becomes [e] or [i], and [o] becomes [o] 
or [u]. However, sometimes the same features are realized by a transition 
[ io ], or [je], and [wo], less often ['uo] (not apparently [we]): 
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(5) Ital. dial. (Chieti, Abruzzo) [bbiajja], [bbuans], (Ischia) [bbjelb], [bbwona] 
‘beautiful, good’ < b6llu, bOnu. 

This process seems to occur in both free and checked syllables, and, as 
vowel length is not clearly phonological in these varieties, there is no evi¬ 
dence that the resulting diphthongs are heavy. 

This type of diphthongization could have been widespread in early 
Romance, though masked in some languages by later developments. For 
instance, it is possible to explain Portuguese tonic [e] and [o] < 6, 6, in syl¬ 
lables followed by [o], as due to levelling of early metaphonic [ie], [uo], or 
in terms of vowel harmony in more modem times. The anomalous 
Catalan [e] < E, compared with [c] < E, in many dialects (an apparent 
phonological flipflop) may be described in terms of generalization of [ie] < 
£, permitting the lowering of [e] < Eto [ej. Once again there is difficulty in 
explaining the spread of the diphthong to all or nearly all lexical forms, in 
Spanish, for instance, or the restriction of the diphthong to Latin free syl¬ 
lables in French and Italian. 

For Asturian the metaphony may be independent of spontaneous diphthong¬ 
ization: 

(6) [gwinu] / [gweno] ‘good’ (count / mass) < bonu / bOno. 

For French and Italian we have to postulate a process by which the light 
diphthong is reinterpreted as equivalent to a long vowel, which can occur 
only in a free syllable. In French this fits in well with other changes - like 
diphthongization of Latin E and oin free syllables (see 6.4), and with com¬ 
pensatory lengthening consequent on loss of syllable-final consonants or 
on diphthong levelling. In Italian it is less convincing, though here there is 
some evidence of vowel lengthening in stressed syllables (alternating 
with consonant gemination - see 6.3). Another alternative explanation 
advanced is that metaphonic raising originally occurred only in free sylla¬ 
bles and then spread to all syllables in some varieties. This story seems to 
assume that the metaphonic diphthongs were originally heavy nuclei, and 
not, as I have suggested, light nuclei. 

In standard Italian the diphthong, especially uo, is often called ‘mobile’ - ‘now 
you see it, now you don’t’: 

(7) pruovalprova ‘proof’, also nuovo ‘new’ < NOUU / nove ‘nine’< nouem; tiene ‘he 
holds’ < TENET / bene ‘weir< bene, leva ‘he raises’< levat / lieve ‘light’ < LEVis. 

This variation is usually explained in terms of dialect mixture, of morphologi- 
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cal levelling, or of learned versus popular development. It could, however, be 
related to the differential lengthening under stress, though with a good deal of 
subsequent lexicalization. 


(b) System pressures 

Mention should also be made of the structuralist explanation of Romance 
diphthongization as a result of the imbalance in the vowel system conse¬ 
quent on monophthongization of Latin AU and AE. This would have trig¬ 
gered off a ‘push-chain’ reaction which had the effect of taking many 
reflexes of inherited fi and 0 out of the simple vowel system and establish¬ 
ing a sub-system of complex nuclei. This works best for French, where the 
medieval language did have a whole set of diphthongs and triphthongs, 
most of which levelled by the modern period. It also accounts neatly for 
apparent non-diphthongization in languages where AU was not levelled, as 
in Rumanian, Portuguese, Occitan. On the whole the structuralist ideas 
about symmetrical systems, cases vides, functional load and push and drag 
chains are currently unfashionable (though see Labov 1994) and explana¬ 
tions in terms of internal systemic pressures are out of favour, as smacking 
too much of teleology. However, certain sound shifts can conceivably be 
seen as consequent on others, where potentially pathological phonological 
mergers threaten. 

(c) New approaches 

In current multilinear phonological theory, explanations of diphthongiza¬ 
tion and diphthong levelling are sought more in terms of differential dis¬ 
tribution of features at different tiers. Thus a heavy diphthong can develop 
from a long vowel as a result of divergence between the skeletal tier and 
the segmental tier, with delinking of the second mora from the skeletal tier 
and then raising: 

V V V V 

V - 1/ I 

e e i 


More catastrophically, it could become a light diphthong by merger at the 
skeletal tier and divergence at the segmental tier. 
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V V 

V - 


e 


A 


Features spelt out sequentially in a diphthong can also come to be concur¬ 
rent in a single segment to form a new monophthong: thus [we] or [ew] 
can, as in French, be monophthongized to [ce], merging the labio-velar and 
the palatal elements into a new mid, or retracted front, vowel. 


(d) Why £ and 6? 

Such changes can occur in any language and at any time, so that there is no 
need to postulate common origin for the parallels between Romance. What 
perhaps we could ask is why Latin £ and 5 were particularly prone to diph- 
thongization. Perhaps the answer is to be sought in the fact mentioned 
before: these were the only two short Latin vowels that survived as distinc¬ 
tive phonemes into most Romance languages. For these two vowels, which 
frequently occur in accented syllables in Latin, there was a certain mismatch 
between their inherent short quantity and their tendency to lengthen, espe¬ 
cially in free syllables, under stress. In unstressed syllables, they everywhere 
merged with reflexes of their long counterparts. In stressed syllables, howev¬ 
er, speakers may have tended to emphasize their differences from close [e] 
and [o], reflexes of E,I and o, tJ. Diphthongal variants may already have been 
familiar from metaphonic forms, and could have been extended to other 
contexts, perhaps not developing as distinct phonological units until well 
into the Middle Ages. That diphthongs were not wholly divorced from their 
monophthongal counterparts may be shown by the widespread medieval 
apophony that sometimes persists into modern verbal paradigms, differenti¬ 
ating stressed from unstressed vowels in the same lexical stem: 

(8) French Je leve (Old lief) / nous levons < LfiVARE ‘to raise’, Spanish muevohnove- 

mos < mOvere ‘to move’, Ital. suonolsoniamo < sOnare ‘to sound’. 

Each language made use of the possibilities foreshadowed in Proto- 
Romance in different ways and the related but varied outcomes could have 
been arrived at independently. 
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4.3 THE INFINITIVE 

The Latin infinitive was originally a verbal noun, but in some varieties it 
came to act as a substitute for a finite verb. In most Romance languages the 
infinitive has tended to extend its verbal functions, and to become the canoni¬ 
cal verbal form - hence its use as an unmarked form in pidgin and creole. An 
apparently more colloquial Latin use of the infinitive of narration (historical 
infinitive) and of exhortation continues into Romance. The latter is found 
mainly in modem officialese, but also in negative person 2 commands: 

(9) Old French Net dire ja!, Ital. Non to dire! ‘Don’t say it!’ 

(a) Complementation 

We have already discussed the Latin accusative + inflnitive construction 
(see 3.4). In some contexts (for instance, after verbs of wanting) this 
rivalled use of a finite clause introduced by a complementizer, like UT. 
What tended to happen, especially in poetic and later Latin, was the exten¬ 
sion to such contexts of the use of the so-called prolative infinitive, in 
which the infinitival noun used as the object of a verb was understood as 
representing a completive clause whose unexpressed subject is identical 
with that of the matrix clause (so-called Equi-NP or control construc¬ 
tions). This contrasts in most Romance languages with the clausal comple¬ 
mentation, introduced by a QU-complementizer, that is required when the 
subordinate subject is different from that of the main clause: 

(10) French Je veux le faire / Je veux qu’il lefosse', Ital. Voglio farlo I Voglio che lo 
faccio'. Span. Quiero hacerlo / Quiero que lo haga ‘I want to do it’, ‘I want him 
to do it’. 

For clitic climbing in such examples see 6.5. 

In most Romance languages a prepositional complementizer (from ad or 
de) can introduce a dependent infinitive: 

(11) French apprendre a lire, outlier de faire', Ital. imparare a leggere, dimenticare di 
fare'. Span, aprender a leer, ohidarse de hacer ‘to learn to read’, ‘to forget to 
do’; aider quelqu’un d faire quelque chose, aiutare qualcuno a fare qualche cosa, 
ayudar alguno a hacer algo, ‘to help someone (to) do something’. 

In most Romance languages an infinitive is preferred to a finite clause 
when the subordinate subject is the same as that of the main clause, or 
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when the subject of the subordinate verb is the object of the main verb. 
Otherwise a finite clause, introduced by a complementizer, is preferred. 
Often the complementizer is closely related to a preposition, with the addi¬ 
tion of, usually, a QU-form for a clause and, sometimes, de for an infinitive: 

(12) French avant (quetde) / Ital. priina (chetdi) / Span, ante (-^ de que I de) ‘before’; 

apres (que), dope (die / di) / despues (de que / de) ‘after’, sans (que), senza (die), 
sin (que) ‘without’. 


(b) Noun or verb? 

The grammaticalized distinction between infinitival and clausal subordi¬ 
nation appears to have become established quite late in the history of the 
languages. During the Middle Ages the infinitive still acted more like a 
noun. In the modern languages infinitival constructions are often in com¬ 
petition with subjunctive finite clauses, and are seen, especially in French 
and Italian, as a way of avoiding the subjunctive. Sometimes they are 
slightly bookish in flavour, avoiding clumsy morphological complexities 
associated with finite clauses: 

(13) French II luifaut le savoir / Ilfaut qu’il le sache ‘It’s necessary for him to know 
it’. 

In modern Rumanian, on the other hand, the inherited infinitive form 
(e.g. cintare ‘to sing, singing’) is used only as a noun, whereas a ‘short’ 
form has more verbal functions. This form is preceded for citation by a 
prepositional infinitive marker - a cinta - but is otherwise sometimes indis¬ 
tinguishable from person 3 finite forms: 

(14) Nuface!‘V>on\AoV! aface\ voinyhce‘I shall do’. 

In some types of complementation the short infinitive can be used in mod¬ 
ern colloquial speech as an alternative to a subjunctive clause: 

(15) 7(7 pot spune / Pot sd p' spun ‘I can tell you’. 

But on the whole Rumanian, like some Southern Italian dialects, has not 
favoured the more ‘Romance’ use of the infinitive in ‘control’ construc¬ 
tions, preferring a subjunctive clause introduced by a special complemen¬ 
tizer (Rum. sd. Southern Ital. dial, mu) 

The older Rumanian evidence suggests that a long (nominal) form 
could be used in complementation contexts: 



Convergence, interinfluence and parallel development 


165 


(16) Au inceput a zidire biserica (1574) ‘They have begun to build the church’ / 

(modem) au inceput sdzideascdbiserica . that they should build .. 

(cf. INCEPERUNT AEDIFICARE BASILICAM). 

Whether the Rumanian reluctance to use the infinitive as a verbal form 
continues colloquial Latin usage is disputable. Some see it rather as a 
Balkan regional feature, perhaps influenced initially by Greek, which has 
also lost the infinitive (see 8.6). The regional distribution in the non-use of 
the infinitive lends support to this hypothesis, as dialects south of the 
Danube seem to have carried the process further than more northerly 
dialects. The Southern Italian dialects that make little use of infinitival 
complementation could likewise have been influenced by neighbouring 
Greek-speaking colonies. 

All varieties of Rumanian differ from the other languages in preferring 
other non-finite forms to the infinitive in constructions like; 

(17) Mjor defdcut ‘easy to do’ (French facile dfaire, \\3\. facile a fare, Span.^aaV de 
hacer); A terminat de scris romanul ‘He has finished writing the novel’ (French IIa 
termini d'ecrire le roman)', magma de spdlat ‘washing machine’ (French machine a 
layer)', II vd viind ‘I see him coming’ (Je le vois venir, Lo vedo venire, Lo veo venir). 

In modern literary use infinitival constructions appear where sd clauses are 
more colloquial: 

(18) Mi-e ru^ine de a-(i spune I sd-[i spun ‘Fm ashamed to tell you’; Mai bine a munci 
(/ sdmunce^tt) in loc de spune (/ decit sd spui) ‘It’s better to work (/ that you 
should work) instead of talking (/ that you should talk)’. 

Such uses are usually viewed as influenced by the standard Western 
Romance languages. 


(c) The personal infinitive 

In Portuguese and Galician (also in some South-Central Italian dialects 
and in Sard, and in older Western Spanish varieties) the ‘infinitive’ (and 
indeed other non-finite forms) can take on verbal characteristics to such 
an extent that it carries person inflections (though not for persons 1 and 3, 
which require use of a pronoun). The personal infinitive (also designated as 
‘inflected’ or ‘conjugated’) is in many contexts a social and stylistic variant 
of the bare infinitive, principally used where there is a desire to emphasize 
the subject of the action: 
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(19) Ptg. O livro epara as crianfos lerem ‘The book is for the children to read’; 
Galician antes de chegarem os teas amigos ‘before your friends come’. 

Standard grammar condemns the use of the personal infinitive after auxil¬ 
iaries and modals, but popular usage in the Iberian peninsula sometimes 
flouts the prohibition: 

(20) Podeis comerdes ‘You can eat’; Vas a saberes o resto ‘You are going to know the 
rest’. 

With causatives and perception verbs, the personal infinitive can be used 
when a lexical noun phrase precedes the infinitival form: 

(21) Vi os cavalos correrem / Vi correr os cavalos ‘I saw the horses run’. 

Whether indeed such forms are truly infinitives, rather than finite subjunctive 
forms, has been disputed. The subject of an infinitive is also often expressed 
with an overt pronoun in Spanish: 

(22) al llegar tu ‘at your arriving’; iPagar yopor eso! ‘Me pay for that!’. 

In non-standard varieties of Brazilian Portuguese and Northern French we 
find also postprepositional infinitives with a disjunctive pronoun subject: 

(23) para mim leer, pour moi lire ‘for me to read’. 

(d) Conflicting tendencies? 

It is hard to discern in most of the developments described consequences of 
interaction between the languages. It looks more as if there were two conflict¬ 
ing tendencies inherited from Latin, one more archaic, by which the infinitive 
retained more nominal characteristics and verbal subordination was 
expressed in a finite clause (introduced perhaps by quod), and another more 
innovating one which stressed the verbal characteristics of the infinitive form. 
The Rumanian long infinitive and Portuguese personal infinitive, in this view, 
represent opposite poles. On the other hand, it is possible to view the indeci¬ 
sion as a consequence of the ‘squishy’ nature of the infinitive, sitting some¬ 
where on the continuum between noun and verb. Case-marked infinitives are 
attested in other Indo-European languages, so here we may view the hesitant 
behaviour of the Romance infinitive as perhaps derived from the properties 
of the category itself, rather than as a peculiarly Romance phenomenon. 

Creole use of a single lexical verb form derived from the infinitive has 
resolved the conflict, coding verb person, tense, mood and aspect by use of 
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detachable particles, and taking to the extreme the analytic tendencies that 
are less clearly discernible in the Romance verb. We have already an exam¬ 
ple of this in the development of the compound perfect. We shall also 
examine the morphological status of other Romance periphrastic forms - 
including the passive and the future - to try and unravel what is inherited, 
what results from interaction and reaction, and what can be attributed to 
‘drift’. But we shall first look at another ‘hot topic’ in Romance syntax. 

4.4 OBJECT CLITICS 

The syntactic question that has probably provoked more discussion in 
recent years than any other within the Romance sphere is that of object 
pronouns and their ordering relative to the verb. We have already seen that 
the ‘discourse’ pronouns derive mainly from their Latin counterparts, but 
disjunctive or full forms can differ from conjunctive (weak or clitic) forms in 
a way that usually can be related to rhythmic differences at an early period, 
when the development of stressed and unstressed vowels diverged. We 
have also seen that anaphoric third-person pronouns, derived from Latin 
demonstratives, can be related to the definite article. The only ‘core’ 
Romance language that uses only disjunctive pronouns is Surselvan 
Rhaeto-Romance, but there is evidence that the loss of clitics in that 
dialect is recent, possibly under German influence. Colloquial Brazilian 
Portuguese also prefers disjunctive pronouns to clitics; this is often seen as 
a sign of creolization. 

The Italian person 6 pronoun loro is always disjunctive; as a dative it is thought 
to be borrowed from French and substitutes in more formal speech for clitic 
gli, which does not distinguish singular from plural. Loro is sometimes inter¬ 
posed between auxiliary and participle (Ho loro parlato T have spoken to 
them’). 

What is most striking about Romance object clitics is not so much their 
form, but the fact that in most languages they precede the finite verb, 
whereas full lexical noun objects follow the verb (SVO being the normal 
ordering). It has been suggested that the clitic ordering represents a fos¬ 
silized left-over from earlier Latin (and Indo-European) SOV ordering, in 
the clause. The historical evidence lends comparatively little support to the 
hypothesis, however. Within the ‘Principles-and Parameters’ framework 
there have been numerous attempts to account for the Romance data in 
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terms of parameter setting, usually assuming different constraints on 
movement of the verb, or on agreement features. 

For Romance object clitic arrays, subject to strict ordering constraints in the 
standard languages, see 6.5. 


(a) Clitics as agreement markers 

Romance philologists, from the beginning of the discipline, have favoured the 
idea that the object clitics represent agreement markers on the verb - echoing 
the person, and sometimes case and gender, of non-subject arguments. Diez 
(see pages 3-4) drew a parallel with Basque, and today many Romanists (some 
using the term objective conjugation) maintain that erstwhile clitics are today 
grammaticalized as verbal affixes. Proponents of this view point to the wide¬ 
spread occurrence of clitic copying or doubling, by which a full lexical object 
and a clitic can both appear in the same clause, and see the clitic as an agree¬ 
ment particle, rather than as a pronominal reduction of a lexical object. 


(b) Clitic doubling 

This was condemned as repetitious and pleonastic by tone-setting arbitra¬ 
tors at the time of standardization of most languages (most strongly in 
French and Italian), and so it is attested mainly in non-standard and 
dialectal usage. 

Everywhere the ‘expletive pronoun’ is characteristically used with left disloca¬ 
tion, a frequent colloquial construction: 

(24) French Jean, je I’ai vu, Ital. Giovanni, I'ho veduto ‘John, I’ve seen him’. 

In some varieties there is no intonational indication that similar sequences are 
indeed left dislocations, rather than simple clauses. 

In standard Spanish grammar use of the ‘redundant pronoun’ is ruled 
obligatory with disjunctive pronoun objects: 

(25) La vieron a ella pero no a mi ‘They saw her but not me’ 

and virtually so with definite objects introduced by a (including ‘preposi¬ 
tional accusatives’- see 3.3): 

(26) Le conviene a Juan ‘It suits John’; Lepuso un nuevo conmutador a la radio ‘He 
put a new knob on the radio’. 
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In some Latin American varieties (especially in Quechuan-speaking areas) 
non-gender-marked lo is habitually used to copy any noun object. 

In contemporary Rumanian, clitic doubling is very frequent, and con¬ 
structions which would be condemned as vulgar elsewhere are acceptable 
in the standard: 

(27) Calulpe care l-am cumpdrat ‘The horse that I bought [it]’. 

In some varieties a proclitic can even be doubled by an enclitic: 

(28) Rum. (Transylvania and Moldavia) L’am vdzutu-l ‘I’ve seen him [him]’ (cf. R-R 
(Friulian) la dit-i ‘I have told him [him]’). 

Clitic doubling is attested fairly rarely before the early modern period. In 
Portuguese today it is mainly confined to contexts where an emphatic dis¬ 
junctive pronoun object refers to a human being: 

(29) Disse-mo a mim ‘He said it to me’. 

(c) Other pointers 

The non-pronoun status of clitics is also suggested by the various uses of 
the ‘reflexive’ clitics, for instance in middle/passive (see below), ‘imperson¬ 
al’ (see 6.5) constructions and in ‘inherent (or lexicalized) pronominals’. 

These latter are found in all the languages (see pages 20-3). They are intransi¬ 
tive verbs with reflexive form but no discernible reflexive meaning: 

(30) se souvenir, acordarse, recordarsi, a j/ aminte ‘to remember’; se battre, battersi, a 
se lupta ‘to fight’ etc. 

The pronominal status of the clitics is also cast into doubt by the tendency 
of popular speech to use a string of clitics, without any clear relation to 
arguments of the verb, especially as ‘datives of interest’ or ‘solidarity pro¬ 
nouns’. The so-called ‘adverbial clitics’ in some languages (see 6.5) also are 
difficult to square with the equation of clitic with pronoun. Another possi¬ 
ble indicator is the use of a neuter clitic as a substitute for a deleted or dis¬ 
located adjective (not usually in Portuguese or Rumanian): 

(31) French (Jeune) it Vest, Ital. (Giovane) lo e. Span. (Joven) lo e ‘(Young) he is [it]’. 

In non-standard varieties the clitic can mark gender in such constructions, so 
that, to a question, a female speaker might reply: 

(32) French (Etes-vous contente?) Je la suis ‘(Are you happy?) I am [f.]’. 
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(d) When did cliticization begin? 

In tune with the reeeived wisdom that all shared Romance features must 
originate in Latin colloquial usage, it has been maintained that an increase 
in the frequency of pronoun usage in the later Latin texts indicates that 
cliticization was already under way. However, the evidence can also be read 
as suggesting that the modern state of affairs dates from the late medieval 
period at the earliest. 

In the older languages ‘weak’ and ‘strong’ pronouns were variants 
depending on difference of emphasis: the unstressed object forms were often 
attached enclitically to an initial-of-clause word - whether the verb, the 
subject or another stressed word - and could be, in these last cases, separat¬ 
ed by other elements from the verb. This is a pattern familiar in other Indo- 
European languages, by which an atonic element cannot begin a clause 
(Wackernagel’s Law is paralleled in Romanee by the Tobler-Mussafia Law). 

(e) Portuguese enclisis 

Enclisis to the sentence-initial finite verb, still marginally possible in liter¬ 
ary style in Italian and Spanish, remains the rule in Portuguese (also in 
Galician): 

(33) Disse-mo ‘He said it to me’ / Nao mo disse ‘He didn't say it to me’. 

Enclisis is regular even with auxiliary verbs in standard Portuguese: 

(34) Tinho-o estudado a fondo ‘I have studied it thoroughly’. 

In colloquial Brazilian Portuguese, where subject pronouns are commonly 
used, such examples are regarded as characteristic of careful style, and the nor¬ 
mative rule is often violated in speech. 

In modern European Portuguese and Galician enclisis of the pronoun has 
been extended to any positive main verb preceded by a non-quantified def¬ 
inite lexical noun subject: 

(35) Os rapazes ajudaram-me ‘The boys helped me’ (but Todos os rapazes me aju- 
daram ‘All the boys helped me’). 

Enclisis of this latter sort is sporadically attested in Old Portuguese (before 
the fifteenth century). It is usually assumed that the lexical subject in such 
constructions was emphatic, perhaps topicalized from an underlying postver¬ 
bal position. Certainly in Old Portuguese VS ordering is frequent and in the 
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modem European variety the linear position of the subject seems to be 
dependent on pragmatic, stylistic or rhythmic, rather than syntactic, factors. 

In modem normative grammar postposition of the pronoun is seen as presti¬ 
gious, and examples of hypercorrection are frequent, especially in officialese: 

(36) Para que cumpra-se o contrato, nao recusei-me as condigoes ‘In order to respect 
the contract, I did not object to the conditions’. 

It is also to be noted that in Portuguese the object pronoun is frequently alto¬ 
gether omitted in contexts where its reference is unambiguous: 

(37) Nunca vi ‘I’ve never seen (it)’; Nao gosta? Don’t you like it? 

In Spanish and Rumanian, too, there is normally no overt pronominal men¬ 
tion of objects that are non-deflnite. In most languages there is some tendency 
to omit direct object mention within a clitic array (see 6.5), but this is usually 
condemned by normative grammarians. 

(f) The change to proclisis 

In the other Romance languages the atonic pronoun, frequently used in 
second position, between the subject and the finite verb, over time came to 
attach itself proelitically to the verb, and the ban on it appearing in initial- 
of-clause position disappeared. Proclisis seems to have been fully estab¬ 
lished in French by the fourteenth century, and Occitan and Catalan seem 
to have followed shortly after. The first Italian grammar (1495) still 
describes the medieval enclitic type, but possibly mainly as a literary 
device. In Spanish proclisis was normal by the sixteenth century. The earli¬ 
est Rumanian texts of the sixteenth century use mainly tonic pronouns 
with usually postposed atonic pronouns, in ways that suggest Slavonic 
infiuence, but proclisis is attested sporadically in texts from the south¬ 
western Daco-Rumanian region of Banat from the seventeenth century. 

It is noteworthy that Greek also turned to proclisis rather than enclisis at about 
the same period as the Western Romance languages. 


(g) Why the change? 

The traditional explanation for the switch in French was that there was a 
change of sentence intonation and accentuation, in the late medieval 
period, from descending to ascending, favouring rightwise cliticization, 
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rather than leftwise. The hypothesis seems to square with the resistance to 
proelisis in positive imperative sentences (with assumed longer persistence 
of descending intonation), and the fact that sentence-initial atonic pronouns 
were first attested in early thirteenth-century texts in polar interrogative 
sentences (with assumed rising intonation). By the end of the thirteenth 
century, too, the old tonic object pronoun forms started to be used to 
replace disjunctive subject pronouns, while the atonic subject pronouns 
cliticized to the verb to become virtual inflections (see 1.2). It is assumed 
that it was around this time that word stress gave way to rhythm-group 
stress in French, so that the inherited tonic/atonic distinction no longer 
made phonological sense. What resulted was a wholesale reorganization of 
much of French morphology. 

Even if this explanation works for French (and perhaps for Brazilian 
Portuguese), it is less plausible for the other languages, where there is little 
other sign of change of sentence rhythms. Another suggestion that could 
account for parallel developments is that, from Proto-Romance on, the 
pronouns increasingly tended to cluster round the verb, and with the grow¬ 
ing rigidity of word order in the early modern period they became insepa¬ 
rable from their host. 

(h) Spread from French? 

Comparativists, I repeat, shrink from postulating that common features 
stem from convergence, rather than from inheritance. But the chronology 
of the geographical progress of proelisis (or at least of preposing) of 
object pronouns through literary Romance does seem to point to diffusion 
from a French innovating centre. Any such influence must have begun at 
the turn of the twelfth and thirteenth centuries and could be linked with 
linguistic and cultural interchange during the Crusades. A text in the 
lingua franca believed to originate in 1204 during the siege of Constan¬ 
tinople has only preposed object pronouns, and may reflect to some degree 
spoken usage. Portuguese, out on a limb at this period and impervious to 
French prestige until a later period, seems to have fought shy of the trend, 
though Rumanian, where strong French influence was felt from the eigh¬ 
teenth century, did not. However, the Rumanian feminine (and neuter) 
pronoun o, thought to be Slavonic in origin, though normally proclitic, is 
still used enclitically with the participle and the gerund: 

(38) O vdd / am vdzut-o la gard ‘I see / have seen her at the station’. 
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(i) Imperatives 

Each language has gone a different length along the road to proclisis. 
Modern French has gone furthest, but jibbing (unlike colloquial Brazilian 
Portuguese) at introducing proclisis into positive imperative construc¬ 
tions. Medieval French postposition of the pronoun to the imperative 
verb was regular only when the verb was in initial-of-clause position, but 
by the eighteenth century this ordering became characteristic of the 
imperative (though literary examples of older ordering can still be found 
even today). 

In the modern language it may not be entirely accurate to describe the 
pronouns in imperatives as clitics. In the negative imperative we have the 
usual atonic proclitics, but in the positive they can occur in rhythmic 
group-final position which requires stress: 

(39) Ne me le donnezpas! I Donnez-le-moi! ‘Don’t / give it to me!’; Ne le dispas!! 

Dis-Ie! ‘Don’t / say it!’ 

In the former example, conjunctive me is replaced by disjunctive moi, while 
in the latter the eonjunctive form is retained, but with stress on the sehwa 
that in other contexts can occur only in unstressed syllables. Some prefer 
to see it not as a schwa (e instable) but as a realization of the phoneme I 0 I. 


(j) Infinitives 


Other standard languages regularly postpose atonic object pronouns, not 
only to positive imperative verbs but also to non-finite verb forms - most 
generally to the infinitive. In Old French this was also a possibility, though 
more frequently a tonic pronoun was preposed to the infinitive, especially 
when the referent of the pronoun was human: 


(40) 


(41) 


pour garder le /pour lui garder ‘to keep him’. 

A preposed atonic person 3 pronoun would be indistinguishable in most cases 
from an article introducing a substantivized infinitive, though occasionally 
both article and pronoun are used in the same construction: 

.. .au metre le en tere 

{La mart le roi Artu, c. 1230) 
‘...on burying him [at the putting of him] in the ground’. 


By the fourteenth century use of the preposed tonic pronoun was unusual. 
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but examples are found until the late fifteenth century. In modern French 
the erstwhile atonic form is always proclitic to the infinitive: 

(42) pour le garder. 

In some other languages (for instance, in South-Western Occitan varieties) 
there are relies of a medieval distinction between preposed tonic and post- 
posed atonic form. 

In modern Sard, though the pronoun is often enclitic to the gerund, it is 
regularly preposed to the infinitive: 

(43) (Nuoro) li ndrrere 'to tell him’ / ndndelis ‘telling them’ 

though this was not so in older texts. The reluctance to use enclisis with the 
infinitive is probably connected with infinitival inflection in Sard. In 
Portuguese too pronoun preposing is obligatory with the personal infini¬ 
tive: 

(44) sem me verem ‘without them seeing me’. 

(k) Proclisis or proposing? 

One difficulty in making generalizations about the Romance forms is judg¬ 
ing how far they are indeed phonetically cliticized to their verb host. In 
Brazilian Portuguese, for instance, it is maintained that object pronouns 
retain a degree of individual stress and are not properly proclitic. 

For older stages of the languages it is often impossible to tell whether a 
pronoun is incorporated metrically to a preceding or following word. 
Romance preposing of the pronoun to the finite verb is usually equated 
with proclisis, but in the medieval languages the pronoun was often enclitic 
to a preceding word rather than proclitic to the following verb. 

More to the point, though, is the question of whether indeed the forms 
in question are ‘syntactic clitics’ - that is, reduced elements standing in for 
full forms that have gravitated towards the head of their phrase - or inflec¬ 
tional affixes that have become an integral part of the verb. It looks as if 
there has been a drift towards affixation in most of the languages, but that 
French has led the way, perhaps dragging others along with it. 
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4.5 PERIPHRASTIC ASPECTUAL FORMS 

In Latin what are known as the periphrastic tenses are those composed of 
the verb ‘to be’ and the future participle or the adjectival gerundive: 

(45) Amaturus/amandus est ‘He is about to love’ / ‘He has to be loved’. 

The former disappears altogether in Romance. The form with the gerun¬ 
dive (or perhaps the ablative of the substantival gerund) has to some 
extent survived, however, to signal progressive or continuous aspect, now 
with active rather than passive import. In most languages reflexes of esse 
have been replaced by other auxiliaries - most often stare ‘to stand’ or 
verbs of motion: 

(46) Sard Estploendo ‘It’s raining’, but Ital. Sta scrivendo, Span. Estd escribiendo 
‘He’s writing’, Old French Va croissant ‘It’s growing’. Span. Va disminuyendo 
‘It’s getting smaller’. 

Some believe that the Romance forms are grounded less in the Latin gerundi- 
val form than in an ancient Greek progressive periphrastic with a present par¬ 
ticiple, which is assumed to have penetrated into spoken Latin. In Romance the 
present participle (in -NT-) and the gerund(ive) (in -nd-) tend to merge. This 
process was completed in French (in -ant forms), while in Spanish and Italian 
the (possibly learned) present participle forms are lexicalized as adjectives. 

The gerund in Latin was a neuter verbal noun in suppletion with the 
infinitive, which could not be used in contexts requiring oblique case 
marking. In Romance the infinitive has widened its scope to the detriment 
of the gerund(ive) forms and it provides the base for newer periphrastic 
forms. In French, for instance, the older Va chantant ‘He is singing’ pro¬ 
gressive type was replaced in the seventeenth century by II est apres chanter 
‘He is after singing’, which survives in Canada, the creoles and some met¬ 
ropolitan dialects. Condemned as vulgar, this was in its turn replaced in 
the eighteenth century by II est en train de chanter ‘He is in the process of 
singing’. The more archaic Estd escrevendo ‘He’s writing’ or Ando procu- 
rando ‘I am getting’, still used in Brazil, are similarly replaced in European 
Portuguese and Galician by Estd a escrever, Ando a procurar. 

(a) Infinitival forms 

We have already seen how the infinitive has tended in modem Romance to 
oust certain finite clauses. In many cases it also combines with (semi-) aux- 
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iliaries to produce what are virtually new tenses. The go future, used to 
indicate the imminence of an event, for instance, today rivals the synthetic 
future in French, Spanish and Brazilian Portuguese: 

(47) II va chanter^ Va a cantor, Vai cantor ‘He’s going to sing’. 

These forms, which are not yet totally grammaticalized, probably date 
from the early modem period. 

Other Romance languages use only periphrastic future forms, with a 
variety of auxiliaries, but again these may be of fairly recent origin: 

(48) R-R (Surselvan) El vegn a cantor [he comes to sing]; Rum. Valalo cinta [he 
wants to sing]; Sard Deppo cantare [he ought to sing]; Ital. dial. Ha (a) cantd, 
European Ptg. Hd-de cantar [he has to sing] ‘He’ll sing’. 

In Catalan and Western Occitan varieties the go + infinitive form func¬ 
tions not as a future but as a preterite: 

(49) Cat. Vaig cantar [baf kan'ta] ‘I sang’. 

Ambiguity of time reference for similar forms is detectable in the medieval 
French and Occitan texts, but grammaticalization to refer to the past is 
unusual. It does, however, occur also in the isolated (Francoprovengal?) 
dialect of Guardia Piemontese in Calabria, in southern Italy, which seems 
to have been carried from north-western Alpine Italy with immigrants in 
the late Middle Ages. This suggests that the germs of the change may have 
once been more widespread. 

Other periphrastic forms with the infinitive that are more or less grammatical¬ 
ized include the French recent past with the semi-auxiliary venir de. 


(b) Drift? 

The variety of outcomes in Romance point to parallel development rather 
than common origin for many of the periphrastic forms. They are often 
used as an illustration of a drift towards analyticity which culminates in 
the creole verb systems. The opposite movement, illustrated by the history 
of the Romance synthetic future, lends support to the view that morpholo¬ 
gy and syntax interact in a cyclical manner, with synthetic forms withering 
away as they are replaced by syntactic combinations, which in their turn 
fossilize into new synthetic forms. 
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4.6 THE FUTURE 

One Romance form that has not ceased to intrigue linguists is the so-called 
‘Romance synthetic future’ and its partner, the conditional or ‘future in the 
past’. These forms provide ammunition for those who maintain that 
‘today’s morphology is yesterday’s syntax’ or that languages pass through 
a cycle which transforms analytic into synthetic forms, only then to replace 
them by new analytic forms. 


(a) A ‘Romance’ future? 

In fact, as we have seen, most Romance languages today use analytic 
future forms, most of which are attested from about the fifteenth or six¬ 
teenth century. The ‘Romance future’ is confined to one part of the 
Western Romance area, and even there is threatened by elimination by 
periphrastic forms. This ‘synthetic’ future is exemplified as follows: 

(50) French II chantera, Ital. Cantera, Span., Ptg. Cantara ‘He will sing’. 

The standard story about the Romance future is that there developed in 
Late Latin a periphrastic habere + infinitive form to replace the Latin 
future, which was vulnerable because of its inconsistency across the conju¬ 
gations (amabo, monebo / AUDiAM, regam) and because its most salient 
marker, the intervocalic B of two of the conjugations, was threatened by 
regular sound change, merging with the u of regular perfect forms. 

Evidence for use of habere + infinitive forms functioning as a future 
tense is extremely doubtful in the Late Latin texts, which in any case make 
ample use of the Classical future forms. Such analytic forms were more 
often attested with imperfect than with present forms of habere, and 
more plausibly carried obligative meaning (‘have to’) than future tense ref¬ 
erence. 


One seventh-century attestation, from Fredegar’s Chronicles, is always cited. 
Here an apocryphal explanation for the place-name Daras occurs in an anec¬ 
dote; to an interlocutor who protests Non dabo ‘I shall not give’, the Emperor 
Justinian replies Daras. Given the jokey context, in a manuscript which dates 
from a rather later period, the example is dubious. But even if genuine it is 
more plausibly read as ‘You have to give’ rather than as a neutral future. 

The next attested examples, in what is usually described as the first 
Romance text (the Strasburg Oaths, usually dated 842, but in an eleventh- 
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century manuscript), salvarai and prindrai, can likewise be read as ‘I have 
to save / take’. An obligative meaning can also be read into many of the 
medieval, and even some of the modern, attestations of the forms. 

The relative positioning of infinitive and auxiliary, with stress on the lat¬ 
ter, also raises questions. In most of the Western languages auxiliaries reg¬ 
ularly precede the lexical verb, yet here a postposed auxiliary seems to have 
been agglutinated as an affix. This, for some commentators, proves that 
grammaticalization must have occurred early, in a Latin that still retained 
verb-final ordering. 

In this perspective the fixed postverbal position could lead later to 
reanalysis of the auxiliary as an affix, as it did not fit into the usual pattern 
of auxiliary ordering. This story does not square well with the possibility, 
in Old Spanish, of the future auxiliary being preposed to the verb, nor, in 
Old Portuguese, of a single auxiliary being used with two infinitival forms. 

More impressive evidence that there was no such reanalysis at an early 
period is provided by the intercalation of object pronouns between the 
infinitive and the postposed auxiliary (tmesis). This suggests that the auxil¬ 
iary was separable from the infinitive and not affixal (for object clitics with 
the infinitive see above). Such tmesis is occasionally attested in medieval 
Occitan, Catalan and Northern Italian, and in Spanish until the seven¬ 
teenth century, but never in French. It remains obligatory for affirmative 
sentences in literary Portuguese to the present day: 

(51) Responder-nos-d ‘He will reply to us’. 

In modern spoken Portuguese, which makes little use of the synthetic future, 
this rule does not hold, nor did it hold absolutely in Old Portuguese or Old 
Spanish. 

It may be that in the medieval languages there were two constructions: a 
periphrastic auxiliary + infinitive which allowed tmesis, used when the 
infinitive began a clause, probably emphasizing its lexical content, along¬ 
side a synthetie future form. In the latter the lexical part of the verb tended 
to lose its identity as an infinitival form and to develop as a future stem, 
with an r marking futurity. In French the future stem early divorced from 
the infinitive (compare venir ‘to come’ with viendra ‘will eome’). In Spanish 
and especially in Portuguese, however, the distinctive future stems were 
rejected by language arbiters at the time of standardization, so that, for 
instance. Old Portuguese salrei ‘I shall go out’ was replaced by sairei 
(infinitive sair). In the modern language there are very few ‘irregular’ 
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futures (like fareilfazer ‘to do’). In Spanish the reform was not so thor¬ 
oughgoing; frequently used forms retained their distinctive shape: 

(52) habra ‘he will have’(01d avrd) I moverd ‘he will move’ (Old movrd). 

(b) The French connection? 

It has been plausibly suggested that the synthetic future was originally a 
Northern French innovation. The fusion of the two elements was perhaps 
connected with early radical sound shifts in that region, which meant that 
the unaccented stem became clearly differentiated from infinitive forms. 
During the Middle Ages French usage may then have influenced the devel¬ 
opment, in prestige varieties elsewhere, of a Proto-Romance inherited 
periphrastic form. Some languages, like Rumanian and Sard, would clear¬ 
ly have been beyond the reach of such influence at that period, while 
others, like Portuguese, resisted it. It has survived, sometimes marginally, 
in standard languages but it is rivalled everywhere in popular colloquial 
use by newer periphrastic forms. 

4.7 THE PASSIVE 

The Latin passive in the infectum forms, an innovation compared with 
most Indo-European languages, was formed with an R that was possibly 
originally a nominalizing affix: 

(53) AMOR, AMABAR, AMABOR ‘I am / was / will be being loved’. 

These forms are completely lost in Romance, though amply attested in 
Late Latin texts. 

Most Romance languages do, however, continue the compound be + pas¬ 
sive participle of the Latin perfectum series. The form with the present tense 
of BE reflects more the present perfect than the aorist, Latin meaning, and it 
is usually indistinguishable from the stative copula + participial adjective: 

(54) Amatus est: French II est aime, Ital. E ainato ‘He is loved’ (rather than ‘He is 
being loved’) 

except on those rare (and often dubious) occasions when the agent is 
expressed: 


( 55 ) 


French La maison est construite {par tes magons), Ital. La casa e costruita {dai 
muratori) ‘The house is built / is being built (by the bricklayers)’. 
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(a) The be passive 

The B£-passive, like the //^Kij-perfect, was probably not fully grammatical- 
ized in the medieval languages, and retained much more the character of 
an adjectival construction. In a few cases in the modern languages, there is 
a tendency to distinguish the participial form in the two senses, with the 
inherited ‘strong’ form used adjectivally, though there is much regional 
and individual variation: 

(56) Ital. E veduto ‘he is (being) seen’/ E hen visto ‘he is well regarded’. 

In Spanish and Portuguese the distinction is made more clearly by use of 
different be verbs {ser < sedere ‘to sit’ replacing some of the forms of 
*ESSERE, for ESSE / estar < STARE ‘to stand’; see 7.5): 

(57) Span. Es despertado ‘He is being awoken’/ Estd despierto ‘He is awake’; Ptg. Foi 
corrompido ‘He was corrupted’ / Estava corrupto ‘He was corrupt’. 

(b) Other auxiliaries 

The stative connotations of the be passive can in some of the languages 
be surmounted by the use of other auxiliaries - especially verbs of move¬ 
ment - which convey a more dynamic aspect: 

(58) Ital. Vengo lodato. Cat. Vaig that ‘I am being (come/go) praised’. 

In the Swiss Rhaeto-Romance languages the normal passive auxiliary is 
the verb ‘to come’: 

(59) Engadine E{a)u vegn ludd, Surselvan Jeu vegnel luddus. 

(c) The SE passive 

In all the languages the RE-passive is frowned on by purists, though in cer¬ 
tain types of discourse (like journalistic or officialese) it is fairly frequent. 
More usually a middle, erstwhile reflexive, pronominal form is preferred 
when the agent is unexpressed: 

(60) French La maison se construit; Ital. Si costruisce la casa-. Span. Se construye la 
casa ‘The house is being built [is building itself]’. 

Apparent examples with an expressed agent, although found in earlier 
stages of the languages, are now usually disputed. 

This SE passive was already adumbrated in Latin and is thought by some 
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to be the true Proto-Romance substitute for the Latin r forms, which were 
themselves originally more middle than passive. All the Romance languages 
have a se passive, though it is less widely used in French than elsewhere. Its 
adoption in French by about the thirteenth century may be a consequence 
of direct influence from other Romance languages via Crusaders, who 
interacted through a pidgin-like lingua franca (‘Frankish language’) which 
drew elements from all the Romance languages, but especially from Italian. 

In Rumanian, where the be passive seems to be a fairly recent and book¬ 
ish introduction, imitated from the Western languages: 

(61) Pachetu! a fast trimis de el ‘The parcel was sent by him’ 
the SE passive is much preferred in colloquial use: 

(62) S’a trimis pachetul ‘The parcel has been sent’. 

The extensive use of the pronominal in Rumanian is often ascribed to 
Slavonic influence. 

The further development, from ‘middle’ uses, of an ‘impersonal pronominal’ in 
some of the languages, probably as late as the sixteenth century, is unknown in 
French (see 6.5). 


(d) Convergence or interinfluence? 

Here again it is difficult to disentangle the threads of development of pas¬ 
sive constructions. Do they all continue common Proto-Romance uses? 
Have they drifted in the same direction from a common base? Or has 
there been interinfluence during the course of history of each of the lan¬ 
guages? Probably all three explanations are valid to some extent. The be 
passive is most clearly derived from Latin, and so in certain types of dis¬ 
course has carried some prestige, but in modern times it is often seen as 
clumsy and ambiguous. Consequently other strategies are favoured, prob¬ 
ably developing in parallel, but helped on by interaction between the lan¬ 
guages. 


4.8 LEXICON 

Whereas Romanists, as good comparativists, usually avoid seeking in con¬ 
vergence or contact the origins of common phonological, morphological 
or syntactic features, the same is not true for the lexicon, where, it is 
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acknowledged, ‘anything goes’. Loanword studies have always been a 
happy playground for the traditionalist, though dismissed as amateurish 
by more rigorous linguists. Purist normative grammarians meanwhile con¬ 
demn or lament in vain the practice of borrowing and calquing foreign or 
regional words. True, loanwords are rarely found among the very basic 
vocabulary items, but at least 30^0 per cent (depending on the breadth of 
the lexicon under consideration) of the normally used vocabulary of any 
Romance language has originated from non-native sources. 

(a) Interchange between the languages 

There is no denying that at the lexical level interchange between the 
Romance languages is of significant proportions, amounting to about 
10 per cent of items in modem everyday usage. At the present time, and 
especially since the Second World War, the languages turn more to English 
for new vocabulary than to each other - much to the disgust of purists and 
nationalist politicians, especially in Francophone communities. 

The extent to which each language is a lender or borrower has changed 
over time, and fashions rapidly fade, but it is probably true that French has 
been a perennial source of long-lasting loanwords for all the languages. 
During the Middle Ages the interchange was two-way between Italian and 
French, though in the thirteenth century, Italian was very much more the 
borrower. Here are some examples. 

Interchange of vocabulary between Italian and French in the Middle Ages 
Examples of Italian borrowing into French: alarme, espion, magasin 
Of French into Italian: abbandonare, giardino, goiello, leggero, mangiare, 
omaggio, profit to, romanzo, sentiero, troppo, villaggio 

The apogee of Italian prestige came at the time of the Renaissance with 
floods of especially cultural and military words entering other languages (not 
only Romance) in the fifteenth and sixteenth centuries. Below are some exam¬ 
ples of French borrowings which also penetrated often into other languages: 

Borrowing from Italian into French in the fifteenth and sixteenth centuries: 
accoster, artisan, assassin, balcon, ballon, banque, bizarre, brave, briller, carton, 
campagne, cavalier, charlatan, esquisse, manquer, masque, numero, parfum, 
risque, saucisson, valise 

In Spanish and Portuguese there was also some preferenee for Italian 
loanwords in the nineteenth and early twentieth centuries, when Ruma¬ 
nian also drew from that source. 
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Leaving aside Rumanian, which was deluged by Gallicisms after it 
attained national language status: 

(63) cabinet ‘office’, plajd ‘beach’, menajd ‘household’ 

the peak of borrowing from French was in the seventeenth and eighteenth 
centuries. Spanish resisted this trend during its Golden Age, but suc¬ 
cumbed again in the eighteenth century: 

(64) botella,frambuesa < bouteille, framboise ‘bottle’, ‘raspberry’. 

Portuguese, turning its back on Spanish from the seventeenth century on, 
has welcomed French loanwords in the modern period: 

(65) duche, greve < douche, greve ‘shower (bath)’, ‘(labour) strike’. 

Until quite recently French was the favourite foreign language of most 
other Romance speakers: in Latin America, especially, this was to some 
extent a reaction against the gringo, but more generally it was a tribute to 
the rich French literary heritage. Today, however, French is losing its grip, 
even in Italy, Portugal and Rumania, the most loyal European admirers of 
French culture and diplomacy. 

It was mainly through Spanish that Arabic and American words pene¬ 
trated into the other Romance languages, usually accompanying new prod¬ 
ucts or ideas (although a subsidiary source of Arabisms was via Sicily): 

(66) naranja ‘orange’, azucar ‘sugar’ > French orange, sucre, Ital. arancia, zucchero; 
chocolate ‘chocolate’, tabaco ‘tobacco’ > French chocolat, tabac, Ital. cioccola- 
ta, tobacco. 

The greatest influx of Spanish loanwords into all the languages occurred 
during the Golden Age, when Catalan was particularly subject to its influ¬ 
ence, after having itself contributed substantially to Spanish vocabulary in 
the fifteenth century. Catalan, and then Spanish, invaded Sard vocabulary 
during its Aragonese period (1326-1718): 

(67) barat, barato ‘cheap’ > baratu. Cat. treballar ‘to work’ > travallare. Span. 
dichoso ‘lucky’ > ditsosu. 

Catalan and Occitan otherwise had little impact on the other languages 
after the medieval period, when their literary prestige was supreme. 

Portuguese was the channel for exotic words from Africa and the East to 
enter the other languages during the colonial period: 
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(68) bambu ‘bamboo’, acaju ‘mahogany, cashew’ > French bambou, acajou, Ital. 

bambu. Span, bambu 

but otherwise it seems to have been a net borrower rather than a lender. 
Admittedly its inherited vocabulary is so close to that of Spanish that it is 
not easy to distinguish it from borrowed vocabulary, but Span, afeitar ‘to 
shave’ and enfadoso ‘irksome’ are clearly of Portuguese origin. It appears, 
however, that Portuguese loanwords often entered other languages via 
French, rather than direct: one example is the word creole. 

(b) Borrowing from non-standard varieties 

Until quite recently many, perhaps most, Romance speakers operated in at 
least two Romance varieties. With standardization, medieval Latin- 
Romance diglossia gave way to standard-dialect diglossia, which has only 
slowly been eroded by universal education. It is inevitable then that there 
has been seepage from one variety to the other. At some periods use of 
dialectal words in the prestige language was fashionable: this was particu¬ 
larly so in French during the sixteenth century. Some revival of this fash¬ 
ion came about in the late nineteenth century as more interest was shown 
in folk culture and literature began to take on a more populist tone. 

But the transfer of words was made more in the opposite direction, with 
the standard variety tightening its grip on local dialects. However, as the 
dialects lose their autonomy, regional versions of the standard grow up, 
which typically incorporate some salient features of the dying dialects - 
this is particularly true of distinctive vocabulary, so that in Italy or Latin 
America, for instance, standard speakers in different regions have a whole 
range of different words for everyday objects and activities, each borrowed 
originally from their local dialects. In a similar way, words that were once 
kept underground, as popular or vulgar, begin to emerge into prominence 
in standard usage, as purist stances relax and literary style tolerates more 
social varieties. This is true particularly of French today, and the same ten¬ 
dency is discernible in European Spanish and Portuguese, as it long has 
been in Latin America. 

4.9 CONCLUSION 

What I have called the ‘club’ effect in Romance manifests itself not only by 
the common plundering of Latin sources that we described in the last 
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chapter, but also in symbiotic interaction between the languages during 
their history. Each of the languages, except perhaps Rumanian and Sard, 
has had something to contribute to the common pool, but relative cultur¬ 
al and political prestige of any language at any time is correlated with the 
linguistic sway it holds on the others. It is in lexical borrowing that influ¬ 
ence is most transparently exerted, but morphosyntactical interchange 
may also occur in situations, like the Crusades, where there is frequent 
interaction between the speakers of the diverse languages. 

More difficult to assess are the parallel developments that have appar¬ 
ently occurred in the different languages, but at different rates and to 
different degrees. In so far as these represent quasi-universal tendencies, 
triggered by functional considerations, there is no puzzle; the replacement 
of synthetic by analytic forms, for instance, is widespread in all the Indo- 
European languages. But some of the Romance developments seem to be 
idiosyncratic - like the introduction of proclitic object pronouns, or the 
creation of a new synthetic future tense. Do we seek the origins of these 
features in a Latin variety for which we have little textual evidence? Or do 
we accept that each language is tracking at different speeds along tram¬ 
lines that lead in the same direction from the same starting point? We 
wonder too whether the creoles represent a further staging point in this 
journey, or some sort of side-track. 

So far we have looked at some of the many shared features of the 
Romance languages. But each language also has its own characteristics. 
Part II looks at the differences between the languages. In the next chapter 
we shall ask what delimits one language from another, and attempt to clas¬ 
sify the languages into different groupings, and then we shall go on to dis¬ 
cuss how the languages became different. 

FURTHER READING 

On diphthongization and on vocabulary interchange, see especially the articles 
by Purschinsky and Hope in Posner & Green eds. (1980-93), vol. i. On the 
infinitive, Joseph (1985); on infinitival complementation, Lamiroy (1984), 
Pearce (1990). On subordination, Dardel (1983). Calboli ed. (1989), Andersen 
& Skytte eds. (1995). On clitics, Nevis et at. (1994), Borer (1984, ed. 1986), 
Jaeggli (1981), Kok (1985), Wanner (1987), Jaeggli & Safir eds. (1989). On aux¬ 
iliaries, Harris & Ramat eds. (1987). On the future, Fleischman (1982). On 
reflexivization, Everaert (1986). On ‘middles’, Fagan (1992). On the passive, 
Klaiman (1991). On noun phrase syntax, Giorgio & Longobardi (1991). 
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How many Romance languages? 


5.1 HOW MANY ROMANCE LANGUAGES ARE 

THERE? 

An answer to this question that has been slightingly labelled sancta sim- 
plicitas is that there is only one: the languages are all alike enough to be 
deemed dialects of the same language. Another equally disingenuous 
answer might be ‘thousands’ - of distinctive local varieties - or ‘millions’ - 
of individual idiolects. The usual textbook answer is ‘ten, or possibly 
eleven’, according priority to putative chronologically early differentiation 
from the common stock, allegedly linked to ethnic differences among the 
speakers. 

(a) State languages 

The layman has no difficulty in accepting as ‘languages’ the five that have 
‘armies and navies’ (some of them several) - that is French, Spanish, 
Portuguese, Italian and Rumanian. Even then there is in some Latin 
American countries a reluctance to accept the names that refer to 
European states: ‘Castilian’ or ‘national language’ is preferred to Spanish, 
and in Brazil terms like ‘Vernacular’ or ‘Expression and Communication’ 
are sometimes used in educational institutions to avoid reference to 
Portuguese. 

For French there appears to be less of a problem about nomenclature, 
but Suisse Romande or Belgian speakers sometimes resent the hegemony 
of Parisian French. Canadian French speakers are more immediately con¬ 
cerned with the challenge of English than with that of metropolitan 
(‘hexagonal’) French. 

Rumanian is also associated with two separate states - in the ex-Soviet 
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republic of Moldavia (Moldova) the majority of inhabitants use 
Rumanian, until recently written in Cyrillic script, and dispute continues 
to rage about separate language status. 

Italian, on the other hand, is the language of a single state (leaving aside 
San Marino and the Swiss Tessin (Ticino) canton) but it overlays a host of 
different still lively dialects, with sometimes comparatively little in com¬ 
mon. In origin it was basically a medieval literary language adopted by the 
Italian state unified in the nineteenth century, but it has yet to win over the 
hearts and mouths of all Italians. 

(b) Literary languages 

Another two Romance ‘languages’ gain general recognition mainly 
because of their status as medieval literary languages, coupled with the 
militancy of linguistic propagandists. These are Occitan and Catalan. 

Occitan is a modern name adapted from the older langue d’oc of south¬ 
ern France, distinguished from northern langue d’oil (oui) on the basis of 
medieval words for ‘yes’. In the nineteenth century it came to be known as 
Provencal, mainly to German linguists and partly because of the mid¬ 
century literary revival, whose main luminary was Mistral, which favoured 
the local varieties of Provence. Occitan is the name preferred by modern 
activists, whose local loyalties usually look further to the west. A standard¬ 
ized non-regional form is recognized in a limited way for French educa¬ 
tional purposes, though modern spoken varieties can differ considerably, 
especially in Gascony. 

One Bearnese variety of Occitan, spoken by the majority of inhabitants 
in the Val d’Aran, in the central Pyrenees, has official status, because the 
valley falls politically within the Generalitat de Catalunya (the regional 
government of Catalonia), which has a liberal language policy. The 
dialects of the extreme south-east coastal area of France, around Nice and 
Monaco, are closer linguistically to Italian Ligurian varieties than to 
Occitan: the zone was not drawn completely into the French political orbit 
until the mid-nineteenth century. 

Old Occitan literature, especially the lyric poetry of the troubadours, is 
acknowledged as among the finest in Europe, before it was effectively sti¬ 
fled by the northern forces of the Albigensian crusade in the thirteenth 
century. The rediscovery of the literature during the Romantic revival was 
partly responsible for the growth of interest in Romance languages: 
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Raynouard (see page 113) virtually identified the ‘language of the trouba¬ 
dours’ with the postulated common langue romane from which all 
Romance languages have sprung, and a youthful Diez (see pages 3-4), on a 
pilgrimage to Weimar, was directed towards Romance studies by Goethe, 
who recommended troubadour poetry to him. 

Literary Old Catalan was very close to Old Occitan, culturally and lin¬ 
guistically, and sometimes called by the same name lemosi (Limousine - 
from the Limoges region). It originated in the north-eastern buffer region of 
Catalonia carved out by Charlemagne from Moorish Spain, possibly as an 
innovating variety of Aragonese. Its prestige was great and it remained the 
chancery language of the Aragonese court until the union with Castile in the 
fifteenth century. The nineteenth-century revival of the language was aided 
by the prosperity of its main urban centre, Barcelona, and in spite of repres¬ 
sion by the Franco regime (although even then it was the official language of 
independent Andorra) it has survived as a vigorously championed language, 
even achieving some degree of recognition within the European Union. 
Varieties of Catalan are used north of the Pyrenees in French Roussillon, in 
the Balearics and even, still actively, in Alghero, a fishing port in north¬ 
western Sardinia. There it survived from the time the town was resettled 
under the Aragonese occupation in the fourteenth century, almost unno¬ 
ticed until its rediscovery by nineteenth-century Catalan archaeologists 
researching in Sardinian prehistoric sites. Algherian has since been drawn 
firmly into the Catalan orbit, with the keen support of the handful of now 
prosperous Catalan speakers in this attractive tourist resort. 

Less enthusiasm for Catalan is shown in Valencia, where a Catalan vari¬ 
ety was, until recently, spoken mainly by the less-privileged inhabitants, 
and where now the tone-setting intellectuals and politicians maintain that 
Valencian has separate language status, though objectively it differs little 
from other Catalan varieties. 

On the basis of an illustrious medieval literary tradition, Galician (Gallego), 
the Iberian language of choice for lyric poetry, should be counted as an addi¬ 
tional Romance language. However, the textbook stance is that it is the same 
language as Portuguese - or rather Galego-Portuguese, associated with north¬ 
western pre-Romance Celtic population and post-Romance settlement by 
Swabians (rather than Visigoths, who occupied most of the rest of the peninsu¬ 
la). The language spread south with the Reconquest but split into two branch¬ 
es when Portugal became a separate state, leaving Galicia as part of Spain. 

(For standardization of Galician see 5.5.) 
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(c) Linguists’ languages 

Two of the textbook Romance languages owe their status to the whims of 
a late nineteenth-century linguist, the Italian Graziado Ascoli (see page 4), 
voicing contemporary ideas about ethnicity and language. It was he who 
first mooted the idea of Rhaeto-Romance as originally a unified language, 
linked closer to French than to the North Italian dialects with which it is 
now intertwined. The name recalls the Rhaetians, who lived in the Alpine 
region in ancient times; the assumption was that their peculiar version of 
Latin survives today in the scattered Rhaeto-Romance dialects, which have 
been depleted by incursions by the Italian Lombard and Venetan varieties 
from the south, and by German from the north. 

The view that they are really centrifugal varieties of North Italian 
dialects, whose prolonged contact with German sheltered them from inno¬ 
vations, is held particularly by Italian linguists, and was given official sup¬ 
port under Mussolini. The Italian claim for the South Tyrol from Austria in 
the First World War was bolstered by the linguistic case. There, in the Alto 
Adige, amid a German-speaking population, the valleys of the Dolomites 
harbour a small number of speakers of idiosyncratic Romance varieties, 
known collectively to linguists as Ladin, and now officially recognized as a 
separate language. There were no written forms of these varieties before the 
nineteenth century. The Ladin speakers’ loyalties went in the past to 
Austria rather than to nascent and impoverished Italy, and in the Second 
World War, following a pact between Hitler and Mussolini, most of them 
chose to throw in their lot with Greater Germany. The region, however, 
remained Italian after the war, a period of terrorism by German-speaking 
separatists being eventually ended by granting the region a degree of 
autonomy. This spectacularly beautiful region, particularly popular with 
German-speaking mountain-lovers, could then settle down to the lucrative 
tourist trade. Ladin is now taught in local primary schools, in a different 
version for each valley, and there seems to be no danger of its early demise, 
nor any desire for standardization and unification. 

The Rhaeto-Romance varieties of the Graubunden (Grisons) canton of 
Switzerland had a rather different history. Mussolini’s political threat per¬ 
suaded the Confederation to grant to the dialects, in 1936, the status of 
‘national’, but not of ‘official’, language; that means that it can be used for 
official purposes within the canton, but not elsewhere within the Confe¬ 
deration (unfortunate, today, when many speakers live in Zurich or 
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Geneva). Each commune, originally 120 in number, could decide whether 
its primary school uses the Romance variety; over the years more and 
more have moved to German, or sometimes to Italian. In those eighty-two 
communes that do adopt it, five different regional varieties are used. The 
‘school-taught’ languages are based more on geographical regions than on 
linguistic affinities. 

The most westerly variety, in the Upper Rhine valley, with the most numer¬ 
ous group of speakers, is known as Romansh (Romontsch, roumanche). 
Linguists call their language Sursilvan or Surselvan (Oberwaldisch). It was 
previously called Churwdlsch, from the administrative centre Chur (Cuera) 
whose population in the fifteenth century was wholly (now only 10 per cent) 
Romance-speaking. The dialect area has shrunk over the last few hundred 
years as German spread its tentacles, and the Surselvan dialect is clearly 
heavily influenced by German. Early texts date from the sixteenth century - 
mainly Bible translations and religious commentaries. The area was riven 
asunder by the Reformation, and different orthographical systems were 
devised for Catholics and for Protestants. A smaller, and shrinking, dialect 
grouping Sutselvan (Nidwald) is quite closely related to Surselvan, which was 
until quite recently often used as a school language in the region. The 
Surmeiran (Ober- and Unterhalbstein) grouping embraces heterogeneous 
varieties, some of which are quite remote from the school-taught language. In 
all these regions bilingualism with Swiss German is general. 

Further easterly varieties, just south of western Austria, have long been 
known to the mountaineer and skier. In the Inn valley, the Engadine is 
split between Upper and Lower dialects {Ladin puter and vallader), while 
the Val Miistair (Munster) version is often classed separately. These vari¬ 
eties are closer to Italian dialects south across the mountain passes than 
are the Western varieties. Influx of German-speaking population has in 
the last half-century reduced radically the proportion of Romance speak¬ 
ers, who often number among the less prosperous of their communities. 

Recently a new unified standard form embracing all the varieties has 
been elaborated (see 5.5) in an attempt to save a language which the Swiss 
claim to be proud of, but do little to sustain. 

Friulian (or Friulan), a third group of dialects associated by Ascoli with 
the Rhaetians, is much closer to Italian than the others, as West Friuli has 
been closely associated with Venice since the sixteenth century. East Friuli, 
though, had, until after the First World War, closer contact with German 
(not to mention Slovenian to the east). Friulian has had a flourishing liter- 
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ature from the sixteenth century and communication between the different 
varieties is relatively easy. It is not recognized as a separate language by the 
Italian state, and most speakers operate freely with the Veneto koine as 
well as in Friulian and Italian. Friulians, like Galicians, have in the past 
emigrated to other parts of the world, often retaining their distinctive 
speech habits. There is even in the Danube delta region of Rumania an 
active community of Friulian speakers, implanted there in the nineteenth 
century. The link between Friulian and the other Rhaeto-Romance vari¬ 
eties is slender and many linguists deny any connection, pointing to the 
features it shares with surrounding dialects of Veneto. 

The other Romance language that Ascoli wished on us is Franco- 
Provengal. Today it is recognized by most linguists as a distinctive set of 
South-Eastern French dialects, bordering on Occitan, and the compromise 
name ‘Francoprovengal’ is used. Some would prefer to see the dialects as 
basically Occitan into which French has made earlier and even greater 
inroads than further west; certainly the vocabulary is close to that of 
Occitan, but that may reflect the archaic character of the dialects rather 
than genetic affinity. Others point to the similarities between Rhaeto- 
Romance and Francoprovengal, postulating a common Alpine ancestor 
that had French-like features. Yet others relate Francoprovengal features 
to those of other ‘outpost’ French varieties in the north and east, which, it 
is assumed, retain more archaic characteristics. In France the dialects, 
which once embraced Lyon, Grenoble and the Jura, have all but disap¬ 
peared, but they have endured better in Switzerland and Italy. They are 
today spoken mainly in mountainous regions in eastern France (Savoy) 
and western Switzerland (Vaud, Neuchatel, Valais) as well as over the Alps 
in the Val d’Aosta (valdotain). Wartburg associated them with Burgundian 
settlement, though it has since been shown that this was not coterminous 
with the Francoprovengal area. Although the language of some medieval 
texts shows signs of provenance from this area, the culture language is 
French (though Mussolini temporarily imposed Italian on his side of the 
Alps). There has never been a koineized version, nor any substantial liter¬ 
ary output. For the relationship of these varieties with German see 8.6. 

(d) One-off 

The inclusion of Sard as a separate Romance language is agreed by most 
Romanists, on the ground that the island was cut off early from the rest of 
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the Romance area. How Romanized the island was is doubtful: excava¬ 
tions in the interior to find traces of ancient roads may cast light on that 
question. Fever-ridden (until American armed forces sprayed the mosqui¬ 
to swamps at the end of the Second World War), it was used as a peniten¬ 
tiary for hardened criminals by the Romans, and it was part of the 
Byzantine Empire and then under Saracen rule. In the ninth century inde¬ 
pendent townships were established, under the authority of Pisa, before it 
succumbed to the Aragonese. Yielded to the Austrians in 1714, it was soon 
handed over to Savoy in exchange for Sicily. It thus was fully integrated 
into Italy at the time of Unification. 

The dialects spoken within the island are not all, however, classified as 
Sard. The Northern dialects, of the western Sassari area or eastern 
Gallurian, lack the Sard distinctive features, like the iPSE-derived definite 
article (see 3.3). It is thought that they are in fact Corsican in origin, 
brought by refugees from vendettas in that island (for Corsican, usually 
classed as a Tuscan dialect, see 5.6). 

It is the central mountainous and, even today, relatively inaccessible 
part of the island which is supposed to harbour the prototypical Sard 
(Logudorian), while the Southern Campidanian group of dialects are seen 
as more heavily influenced by Italian. Although Sard is attested in charters 
from the late eleventh century, it never was standardized, and intercommuni¬ 
cation between the dialects is minimal. In the nowadays semi-autonomous 
region, the authorities encourage development of the language, but much 
of the island has remained poor, depopulated and bandit-ridden, with 
high emigration rates, so that there has been little popular pressure for 
standardization. 

(e) A thoroughly dead language 

Another language that sometimes rounds off the textbook score is 
Dalmatian, now totally extinct. It is seen as bridging the gap between 
Italian and Rumanian, and representative of a whole host of lost 
Romance languages that must once have existed. We know that a Romance 
language was once spoken along the Dalmatian coast, and that Ragusa 
(now Dubrovnik) still used the language in governmental functions in the 
sixteenth century, before Croatian took over completely. It is difficult to 
tell how different the language was from that of Venice, which controlled 
ports along the coast for centuries. The only evidence we have of a living 
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version of the language is that collected from the last surviving speaker, 
who was killed in 1898 by a landmine in the island of Veglia (Krk), off the 
Istrian peninsula. He seems to have been a garrulous, but not necessarily 
reliable, witness. He had never been an active communicator in the lan¬ 
guage, having picked it up from private conversations between his parents, 
who used it as a secret code. His own toothless condition, at the time of 
the investigation, may have played havoc with his articulation, and the 
transcriptions are in any case probably rather hit or miss. 

The fact that the informant had worked alongside Friulians (whose con¬ 
versational exchanges he said he could follow easily), and had frequent 
contact with Venetian, may render his evidence a little suspect. 
Nevertheless the information we can glean does suggest that the variety he 
used differs in some respects from other Romance varieties. In the Istrian 
peninsula nearby, some other tiny pockets of Romance remain - some of 
these are labelled Tstriot’ by investigators who see them as another distinct 
language, but others claim they are versions of North-Eastern Italian 
dialects, whereas Ascoli linked some of them with Rhaeto-Romance. 
There are also isolated localities using Tstro-Rumanian’, which appears to 
be more closely related to the Daco-Rumanian of the standard language 
than to other varieties, but infiltrated through and through by Serbo- 
Croat. Their origin is lost in the mists of time, but the communities are 
thought to have fled the Ottomans at an early period, and they have no 
folk-memory of Rumanian culture. 

(f) More languages? 

Other Romance varieties lay disputed claim to ‘language’ status: some of 
these will be discussed under Standardology (5.5) and others under 
Language and Dialect (5.6). 

5.2 GENETIC CLASSIFICATION 

Although there is comparatively little doubt about the membership of the 
Romance family, there is considerable disagreement on sub-grouping with¬ 
in the family. Classically, sub-grouping is based on historical and geo¬ 
graphical criteria. Assuming a past-time unified language, the Stammbaum 
(family tree) model looks for successive innovations, shared by more than 
one daughter language. Each diagnostic innovation creates a new unified 
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proto-language differentiated from its siblings, which then may be further 
split by later innovations among some of its members. As each sub-group 
is envisaged as a real-life language it must occupy space (geographical or 
social) as well as time. Causes of the innovations are usually sought in eth¬ 
nic or social factors, but breakdown of communications between far-flung 
areas also takes pride of place. 

The earliest Romance split postulated by traditional sub-grouping is 
that involving the loss of Latin vowel quantity (see 3.3). The merger of I/E 
and u/o is seen as an innovation that affected ‘Western Romance’ (or 
‘Vulgar Latin’, see 3.2), eluding those regions that had become isolated 
from trendy innovations at earlier periods - like Sard, Rumanian and 
Southern Italian varieties. 

Within the Stammbaum model usually in Romance only phonological 
criteria for diagnostic shared innovations are considered valid. George 
Trager in 1934 also introduced, as a check, some morphosyntactic criteria 
like the use of the two-case system in Gallo-Roman (French and Occitan). 
Nevertheless he grouped case-free Catalan more closely with Occitan than 
with French. On the basis of non-palatalization of Latin /k/ before front 
vowels, Dalmatian is grouped with Sard (as ‘insular’ Eastern Romance, as 
distinct from the ‘peninsular’ variety represented by Italian and 
Rumanian). 

The now most widely accepted version of the Romance Stammbaum is 
that of another American, Robert Hall Jr, who, having split off the isolates 
(first Proto-Sardinian, then Proto-Eastern Romance) from ‘Proto-Italo- 
Western Romance’ using vocalic criteria, then proceeds to use only criteria 
of consonantal changes to effect further sub-grouping. Proto-Italian is 
distinguished from Proto-Western on the basis of the treatment of clusters 
like /kt/ (assimilated in Italian but palatalized in Proto-Ibero- and Proto- 
Gallo-Romance; see 3.3). Then Pro to-Western Romance is sub-divided 
further mainly by reference to palatalizing innovations. The end-result 
looks suspiciously like areal grouping. 

Lexicochronology adopts similar assumptions to traditional 
Stammbaum theory but uses the criterion of shared basic vocabulary for 
sub-grouping. The most recent revisions of the method point to subgroup¬ 
ing on areal lines, with Rumanian and French at the poles, Portuguese and 
Spanish very closely related, and Catalan more loosely associated with the 
same group, while Friulian is more closely bound with Italian than with 
the other Rhaeto-Romance languages. Lexicochronologists are less con- 
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cerned to establish a Romance classification than to justify the validity of 
their methodology: therefore they are pleased that their sub-grouping, and 
the associated time-depth, closely resembles traditional areal ones. 

A more radical but committed American reconstructionist relying on 
phonological criteria, Clifford Leonard, insists on consistent use of only 
vocalic changes, seeing the most important split off as that between 
‘heterochronic’ versus ‘isochronic’ Romance languages. The former distin¬ 
guished, at an early stage, between free and blocked syllables in their 
treatment of stressed vowels, sometimes lengthening the vowel in the free 
syllable (see 4.2). A sub-grouping along these lines would class the dialects 
of north-east Italy and of Abruzzo and Apulia with French and 
Francoproven 9 al, but Venetian and Occitan with Spanish and Portuguese. 
This areal distribution would suggest that the heterochronic varieties are 
innovating, leaving a margin of more conservative areas. 

This grouping is consistent with a ‘wave’ model that, since Schuchardt 
(see page 4), has always been more widely favoured among European 
Romanists than the Stammbaum model. It accounts better for the appar¬ 
ent catenation of interintelligible varieties, with few distinct dialect bound¬ 
aries, across the Romance area. 

A widely favoured distinction between central innovating areas and the 
conservative rim is based largely on lexical criteria. For instance, 
Spanish/Portuguese and Rumanian seem to share eertain more archaic 
items that are lost elsewhere, like formosus rather than bellus ‘beautiful’, 
MAGis not PLUS ‘more’, or fervere not bullire ‘to boil’: 

(1) Span, hermoso, mas, hervir, Ptg. formoso, mais,ferver, Kum. frumos, mai,fierbe / 

French beau, plus, bouillir, Ital. bello,piu, bollire. 

Another distinction along similar lines, but using a range of criteria, 
most notably verbal morphology and lexical translation equivalents, 
would distinguish Romania continua, running from the Atlantic along the 
Mediterranean, from the two outposts, in northern Franee and Rumania 
(with Portuguese on the Atlantic rim also showing idiosyncratic features, 
and Castilian acting as a bit of a maverick). 

Other historical classifications are equally areally based. The most gen¬ 
erally accepted is the east-west divide, most consistently advocated by 
Walther von Wartburg, who drew a dividing line across Italy from La 
Spezia to Rimini. Wartburg’s criteria, curiously for a scholar best known 
for his lexicological work, were phonological - the west retains Latin final 
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-s and voices intervocalic voiceless consonants - but they were linked to 
social factors (see 6.2). The conservative retention of -s, important for 
morphology, was seen as due to the influence of schools, whereas the 
retention of intervocalic surds and of geminates in the east was a popular 
conservative feature. Sard conserves both features, having escaped the 
popular innovation of loss of final -s as a consequence of early isolation 
from the rest of the Romance area. 

Another less widely accepted areal grouping sees the Mediterranean as 
the centre of ‘Romanceness’, continuous intercommunication having pre¬ 
sumably been maintained by sea transport. 

(a) Classification on the basis of historical data 

In the post-Second World War period other attempts were made at quanti¬ 
tative classification with a historical perspective, using various criteria, but 
without specific reference to proto-stages. They arrive at different group¬ 
ings, or more often rankings. Mario Pei in 1949 measured seventy-seven 
‘change-points’ in Latin stressed vowels and came out with a ranking that 
placed French at one pole with 44 per cent of the points and Sard at the 
other with 8 per cent. Unusually, Portuguese by this measure was not 
grouped with Spanish. Joseph Grimes and Frederick Agard’s 1958 quanti¬ 
tative classification of phonological changes from Latin did place 
Portuguese nearest to Spanish, and French, showing most changes, at the 
opposite pole from Sard, but linked, through Catalan to the Iberian lan¬ 
guages. 

(b) Mixed historical/synchronic classification 

Zarko Muljacic’s classification of twelve languages mixes diachronic and 
synchronic data, using forty plus-or-minus parameters from different lev¬ 
els (of which sixteen concern phonological changes familiar from 
Stammbaum classifications). Many of these are chosen because they dif¬ 
ferentiate one language from the others, either as innovations or as 
archaisms - like the use of an iPSE-derived definite article (see 3.3) or of 
two forms of negative non in Sard, and the enclisis of the definite article, 
or the use of nominal case forms (see 3.3) in Rumanian. Nasal phonemes 
(see pages 24-9) differentiate French, Francoproven 9 al and Portuguese 
from the others, whereas the existence of central vowels characterizes 
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French, Occitan, Francoproven 9 al and Engadine Rhaeto-Romance. In 
Catalan, Spanish and Portuguese, there is a three-term set of demonstra¬ 
tives (corresponding to the three persons of the verb -‘here by me’, ‘there 
by you’ and ‘over there’), while a prepositional accusative (see 3.3) is used 
in Rumanian, Sard, Engadine Rhaeto-Romance, Spanish and Portuguese. 

A pairwise comparison of shared features, unsurprisingly, reveals that 
French and Francoprovengal, and Spanish and Portuguese, are very elose 
one to the other. More controversially, Catalan comes out closer to 
Occitan and to French than to Spanish, while Friulian is closer to 
Engadine Rhaeto-Romance than to Italian, which is quite far removed 
from most other languages, especially French. 

But how far is this grouping dependent on the choice of criteria rather 
than on real-life communicative factors? Giovanni Pellegrini, using virtu¬ 
ally the same measures, but including in his purview some Italian (and 
from his standpoint more significantly, Rhaeto-Romance) dialects, demon¬ 
strates that the category ‘Italian dialect’ makes little linguistic sense: there 
are more differences between standard Italian and a Lucanian dialect than 
between that dialect and Spanish or even Rumanian. The Rhaeto- 
Romance of the Dolomite Val di Fassa may be nearer, on these criteria, to 
Friulian than to Italian, yet, Pellegrini asserts, a Friulian finds it easier to 
read a newspaper even in Spanish than one in Dolomite Ladin. 

5.3 TYPOLOGY 

Typological classification of the Romance languages is less commonly 
practised than genetic grouping. The languages are, it is often argued, very 
similar typologically, and indeed fit into patterns that are familiar in 
European languages. Whether there is indeed a Eurotype, and what its dis¬ 
tinguishing characteristics are, is currently under investigation in a large- 
scale inter-European project under the auspices of the European Science 
Foundation. Certainly there are many formal resemblances between the 
Romance languages and the Germanic languages, even leaving aside the 
substantial resemblances that date back to their common Indo-European 
ancestors, or that have arisen since through communicative interaction. 

A 1963 study, by the Chilean Heles Contreras, applied to seven 
Romance languages the now classic morphosyntactic indices devised by 
Joseph Greenberg (measuring, for instance, the relative degree of synthesis 
versus analysis, of agglutination, derivation and compounding etc. in con- 
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structions). Portuguese was shown, by this count, to be the most, and 
French the least, synthetic of the modern languages, while in other 
respects Sard most resembles Latin. No clear classification emerged from 
the exercise, but it appeared to bunch together first French and Catalan, 
and then Spanish, Italian and Rumanian, with Portuguese and Sard as 
isolates, loosely associated to the first and second main groupings respec¬ 
tively. 

More recently the German Karl-Hermann Komer has proposed that 
the Romance languages be split typologically into two groups. One, 
including French and Italian, makes use of the partitive article and agree¬ 
ment of the past participle in the compound tenses with preposed objects 
(see 6.4), as well as the ‘predicative relative’ (or ‘tensed complement’, see 
7.4) and the ‘nominativus pendens’ (that is, topicalization in preference to 
left dislocation with clitic doubling; see 4.4). Only languages in this first 
group may be ‘non-pro-drop’ (though many of them are not). The other 
group, including Spanish, Portuguese, Sard, Rumanian and Engadine 
Rhaeto-Romance, uses a ‘prepositional accusative’ (see 3.3), left disloca¬ 
tion with clitic doubling, the infinitive as a noun and with an overt subject 
(see 4.3), as well as a dative as subject of the infinitive in causative con¬ 
structions (see 3.4). The data adduced are hotly disputed, especially for 
Italian, where Northern varieties seem to be privileged in the analysis. 
Korner, however, insists that he is looking for preferences rather than 
absolute distinctions, and that the division does not apply to the past, 
when the features he selects as diagnostic for typing the modern languages 
were admittedly more widespread over the Romance area. 

The Principles-and-Parameters version of Government and Binding 
(GB) theory has also led to the postulation of a whole host of potential 
typological distinctions within modern Romance - of which the most 
fruitful has been that correlated with the ‘pro-drop’ parameter (see 1.2). 
Other candidates for parametrization have concerned head government 
(manifested, for instance, in adjectival and determiner placement, see 3.4), 
verb raising (with evidence from, for instance, negative placement; see 7.4), 
the status of reflexive pronominals (see 4.7, 6.4), and hosts of others. In 
more recent ‘minimalist’ theory, parametrization has been more closely 
linked to morphology than to syntax; approaches along these lines have 
been made to features like the ‘Romance future’ (see 4.6) and placement of 
object clitics (see 6.5). 

Typological distinctions based on surface ordering of elements in the 
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sentence are now somewhat out of fashion in Romance studies, but did 
have some vogue among Romanists for a while. The putative passage of 
Latin SOV to modern SVO ordering was assumed to have passed through 
a stage where the Topic, rather than the Subject, took pride of place. 
Modern languages like French with consistent SVO ordering have also 
been distinguished from those with pragmatically determined freer word 
order. It has been proposed that Spanish, for instance, is basically a VSO 
language, and that French is heading in the same direction, with grammat- 
icalization of subject clitics as person markers (see 1.2). 

5.4 DIALECTOMETRY 

The greatest problem about such comparisons is that the data used are 
disparate: the findings of even the most meticulous dialect description are 
not directly comparable with the prescriptions of an official standardized 
language, backed by educational and administrative institutions. In parts 
of the south of Italy or in Sardinia, no dialect koinai have become estab¬ 
lished and nearby local varieties can differ considerably, with only the 
standard being used for intercommunication between localities. In north 
Italy, on the other hand, purely local varieties are ousted by regional 
koinai that tend to be sharply delimited one from the other. The diverse 
dialects are ‘Italian’ only in the sense that they are spoken in Italy: their 
linguistic relationship to the standard arises from interaction rather than 
any deep-seated and ancient affinity. The classic model of Italian dialec¬ 
tology, championed by Ascoli (see page 4), was that the standard, based 
on the Tuscan dialect, represented the continuation of a Latin relatively 
uncorrupted by contact with other ancient Italic dialects (allegedly 
because the substratum Etruscan language was so far removed from Latin 
that interference was minimal). It therefore was taken to represent most 
nearly the common denominator from which all dialects were historically 
derived. 

What was neglected in this classic account is the apparent closer linguis¬ 
tic affiliation of some ‘Italian’ dialects to non-Italian Romance. Indeed 
everywhere Romance non-standard varieties share features and tendencies 
not always reflected in the related standards. Support is thereby lent to the 
Schuchardtian contention that at the level of folk usage, before the imposi¬ 
tion of standards, there were, within the Romance area, few, if any, bound¬ 
aries (substantial bundles of isoglosses). Therefore classification other 
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than on a geographical basis was deemed impossible. However, modem 
‘dialectometry’ attempts to measure the linguistic distance between the 
local varieties, taking into account degrees of interaction. 

Dialect studies in the Romance area have a long and respectable history, 
dating back to the eighteenth century. Before that there were informal ref¬ 
erences to regional variants, as well as works explicitly composed in dialect, 
usually for comic effect or with political motives. The earlier more serious 
traditions favour the dialect monograph (often by a native speaker and 
usually concentrating on phonological changes from Latin, and highlight¬ 
ing differences between the local dialect and the related standard), and the 
questionnaire about non-standard usage, circulated to local dignitaries 
(the first being that of Abbe Gregoire, during the French Revolutionary 
period, which sought information for sociopolitical purposes). In the 
twentieth century priority has gone to the dialect atlas, which presents in 
spatial terms local variants of lexical items and common phrases. The first 
was the 1902-10 Atlas linguistique de la France i^ALF), under the direction 
of Jules Gillieron, a maverick linguist, who is best known for his opposi¬ 
tion to Neogrammarian sound-laws and his championship of the atom¬ 
istic ‘Each word has his own history’ slogan. The atlas initiated the use of 
amateur fieldworkers (specifically an ex-shopkeeper, Edmont) cycling 
between localities and registering the responses of chosen ‘good speakers’ 
(usually elderly) to a predetermined set of questions. 

The prestige of the ALF encouraged official support for a massive new 
series of more professionally executed French regional linguistic atlases in 
the postwar period. The ‘ecological’ aspect of this type of study is para¬ 
mount: fear that doomed rural dialects would not be recorded for posteri¬ 
ty triggered the attempt to preserve part of a national heritage that no 
longer served social or economic goals. 

The equally prestigious Italian linguistic atlas of 1928-40, Sprach- und 
Sachatlas Italiens und der Sudschweiz {AIS), edited by the Zurich 
University professors Jaberg and Jud, was conceived in the ‘Word and 
Thing’ tradition, seeking to link traditional culture with the language used 
to convey it. The fieldworkers included future luminaries of Romance lin¬ 
guistics like Max Leopold Wagner and Gerhard Rohlfs, and its data 
remain an invaluable record of interwar dialects in Italy, Sardinia and 
southern Switzerland. 

The tradition of linguistic atlases has been continued in the other 
Romance countries though, given the expense of carrying out and publish- 
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ing such projects, they have not always seen the light, except when they 
have received, as in France, substantial government subsidies. 

It is the dialect atlas that fostered the idea of measuring the distance 
between dialects - partly because isoglosses traced along the boundary 
between alternative forms rarely coincided with meaningful frontiers 
between sets of dialects. In France a bundle of phonological isoglosses do 
mark, in the west, the switch from French to Occitan, but the isoglosses 
fan out unpredictably in the east. The Western clearcut division may in any 
case be misleading, as there is evidence that it is there that Northern vari¬ 
eties have since the Middle Ages encroached heavily on Southern varieties, 
pushing perhaps some of the diagnostic isoglosses further south to form a 
more compact bundle. In the east, mainly in more mountainous terrain, 
incursion of Northern dialectal features may have been less consistently 
triumphant (though see 5.1 for ‘Francoprovengal’, as fundamentally a 
French rather than an Occitan variety). 

In Italy, too, a bundle of isoglosses appears to silhouette the central 
dialects, which were for centuries under papal rule. It is assumed that their 
separation from the prosperous north and the indigent south was accom¬ 
panied by communication barriers. It is noteworthy that regional dialect 
koinai, which are an important feature of the northern Italian linguistic 
scene (see 8.4), failed to develop in the Papal States. 

Typically, however, Romance dialects form a chain of mutually intelligi¬ 
ble varieties (leaving aside, of course, Rumanian). There are few insur¬ 
mountable natural barriers to communication within the Romance area: 
the Pyrenees and the Alps have accessible passes; rivers and seas can be 
crossed in boats. Traditionally villages would group round the market 
town and the diocese, but young people could still, with a little more effort, 
go courting in less familiar neighbourhoods. It has been claimed that in 
the Middle Ages forests posed more of an obstacle to intercommunication 
than other natural features: Old Walloon’s distinctiveness compared with 
other Northern French dialects has been attributed to a densely wooded 
dividing belt, rather than to the later political separation of the Walloon 
area in Belgium. In modern times the area, assumed previously to be 
thickly forested, forms a transitional dialect wedge between Picard and 
Walloon, with some features of both. 

In any case there were fewer settlements in remote areas in the Middle Ages. As 
population expanded and protection against aggression improved, more isolat¬ 
ed localities were settled, sometimes by people coming from far afield. 
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Dialectometrists are concerned mainly with the confines of areas of possi¬ 
ble intercommunication. Nowadays, of course, speakers rarely even 
attempt discourse in their local dialect with strangers from other localities, 
resorting immediately to a version of a standard, or at least of a regional 
koine. Besides, intelligibility depends on the goodwill and co-operative¬ 
ness, as well as the range of experience, of interlocutors, more than on lin¬ 
guistic factors. 

Jean Seguy, when in charge of the regional atlas of Gascony, noted that 
often there would be more differences between two close-by dialects than 
between those further off. Each locality diifferentiates itself from near 
neighbours (not only by language, but perhaps by wearing a hat in a differ¬ 
ent way and other such markers). This does not hinder communication, 
though, as speakers will be fully conscious of the dissimilarities, often a 
butt for mockery in the privacy of in-groups. Seguy maintained that it was 
not phonological variants (which merely produce unheeded surface noise) 
but lexical differences that make for loss of intelligibility. Positing that a 50 
per cent difference in vocabulary would render a foreign dialect impenetra¬ 
ble, he traced across Romance dialectal space the lexical representation of 
a hundred everyday concepts randomly chosen from atlas material, finding 
that there was a fairly constant relationship between lexical differences and 
geographical distance within the same umbrella ‘language’. On an imagi¬ 
nary walk across southern France, from Biarritz on the west coast to 
Menton in the east, there is no intelligibility frontier in the 700-odd kilo¬ 
metres covered, but going from south to north from the eastern Pyrenees 
to Liege in Belgium, a frontier has to be crossed at about half that dis¬ 
tance. On his journey north through Italy from Apulia to the Alps he 
encounters an intelligibility frontier after some 850 kilometres, but then 
things do not get much worse during the next 200 or so kilometres cov¬ 
ered. To illustrate his point more sharply he pinpoints the sudden increase 
of lexical differences as he passes from his home-base in Gascony, east to 
Catalan (Roussillon) and west to Basque territory. 

Seguy and his successors, most notably Henri Guiter and Hans Goebl, 
also use more sophisticated quantitative techniques and a wider range of 
criteria, though still culled from atlas data, in order to trace the bound¬ 
aries between dialect groupings. Sub-dialects, dialects and languages are 
defined in terms of the concentration of differences or similarities of lin¬ 
guistic features over geographical space. Transitional areas and hybrid lan¬ 
guages nevertheless still exist, and there are gaps in frontiers at points 
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where intercommunication between the different groups is more frequent. 

Nevertheless a perception of what constitutes a dialect grouping may be 
related to a real or imaginary prototype. For instance, the most prototypi- 
cally Gascon of the dialects or south-west France is Bearnese, hard up 
against the Pyrenees. From there in every direction dialects display fewer 
emblematic Gascon features as they retreat from the Pyrenees, with partic¬ 
ularly intense de-Gasconization around large centres of population. 

5.5 STANDARDOLOGY 

If dialects merge into each other on the ground, each modern standard is, 
on the other hand, almost consciously defined in contradistinction to the 
others and usually covers a defined territory. Within each area local usage 
nowadays adjusts to a greater or lesser degree to the standard (reacting 
sometimes by setting up its own rival standard). 

A type of classification known as ‘standardology’ distinguishes the 
Abstand (‘stand-off’) from the Ausbau (‘built-up’) language, each embrac¬ 
ing a range of related varieties, social or regional. The former may on 
linguistic grounds (but which?) be distinguishable from its neighbours, but 
it is conscious elaboration that establishes the clearly delimited standard. 
The Abstand can be described in terms of a distinct diasystem. The 
dialects it covers usually shelter under the ‘roof’ (a better metaphor might 
be ‘are within the orbit’) of an elaborated form of one of the dialects, or of 
the fruit of accommodation between the dialects (a koine). In some cases 
the ‘roof’ may be provided by a totally distinct language: the Germanic 
dialect of Alsace operates, for political and cultural reasons, within the 
orbit of standard French and not of standard German. 

(a) Abstand 

The diasystem can manifest itself spontaneously as a koine, used for inter¬ 
communication between speakers of related dialects who have some sense 
of solidarity. Such koinai grew up, as we have seen, in northern Italy, and 
perhaps in medieval northern France. In southern France a degree of 
koineization between local dialects was effected after the Revolution. 
Compulsory military service, which grouped young conscripts by region, 
helped to create an underground barracks Occitan, along with a comrade¬ 
ship that distanced them from the French-speaking officer class. 
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The more radical levelling (or perhaps non-differentiation) of (Daco-) 
Rumanian dialects may be different in character. It has been suggested (by, 
among others, Hungarians who still smart from the secession of 
Transylvania in 1918) that the lack of dialect diversity in the Rumanian 
state is evidence that the language was imported at a comparatively late 
date from south of the Danube (where isolates are much more diverse - 
see 5.6) by Vlach migrant herdsmen. Other possible explanations are that 
feudal organization and its consequent fragmentation of early medieval 
social grouping never hit that corner of Europe, or that the Ottoman strat¬ 
egy of Balkanization (dividing the Empire into small satellite states the 
better to exercise control) had the effect of ensuring for Wallachia and 
Moldavia more internal linguistic unity. 

(b) State standards 

The codification of the four Western Romance standard (Ausbau) lan¬ 
guages can be dated back to the Renaissance, though vernacular literary 
and administrative texts are numerous from the twelfth and thirteenth cen¬ 
turies. The desire for a unified state language is associated with the threat¬ 
ened demise, in the sixteenth century, of Latin as the vehicle of culture, law 
and government. At the same time there were being fashioned in Europe 
large states with more absolute power than had been possible in the feudal 
era. The adoption of an authoritative state language could, it was thought, 
bolster external prestige, manifested by cultural brilliance and military 
conquest, as well as internal control, enabling the raising of the resources 
to finance the prestige. There was no need, though, for it to be imposed as 
a spoken idiom on those whose only role in government was to supply tax 
revenue and cannon fodder: indeed there was advantage in restricting 
access to the language of power. Why, therefore, frown upon a codified 
elitist or regionally restricted language that was far removed from the 
speech of the masses? 

(i) Spain 

In almost the first of the European unified states, Moorish occupation 
confined the early medieval Christians to the far north, with their 
Galician, Asturo-Leonese and Aragonese dialects strung out from west to 
east. In the south, a koineized Spanish Mozarabic was allegedly used by 
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Moors, and especially by Jews and the small part of the population 
remaining Christian (most of whom eventually went north to resettle con¬ 
quered areas). 

Castilian was probably originally an innovating Leonese variety, min¬ 
gled perhaps with Mozarabic and Basque, developed by the settlers of the 
first reconquered territories (the land of the forts, castillas). It eventually 
took over most of the peninsula, as it swept down, widening its territory as 
it proceeded south. A series of alliances, as well as military successes, 
meant it engulfed first (from the thirteenth century) the Leonese and then 
(at the end of the fifteenth century) the Aragonese territories, and flooded 
the south, with the capture of Granada, the last Moorish bastion, in that 
famous year 1492, when the quest for the New World was launched, and 
the Jews were expelled. The Reconquest resembled what is now called ‘eth¬ 
nic cleansing’, with expulsion and extermination of undesirable elements, 
and resettlement of depopulated zones by more congenial immigrants, 
making for more homogeneous language use the further south it penetrat¬ 
ed. The imposition of Castilian as the official language of the state dates, 
however, only from the early eighteenth century, on the advent of the 
Bourbons to the Spanish throne. 

The result has been that, except in the north, where original dialects per¬ 
sisted, Castilian became not only the administrative, but also the spoken, 
language of Spain (though in the west, Portuguese first asserted its politi¬ 
cal independence in the twelfth century, and in the east, Catalan was the 
chancery language of Aragon until the union with Castile). Royal patron¬ 
age, obsessed with glory and its holy vocation, aimed to render the lan¬ 
guage of Castile, as perfected in Toledo, worthy of the kingdom and its 
religious mission, and fit to be spread through the country, and eventually 
round the world, with the empire-building forces: the first grammar was 
ceremoniously presented to Queen Isabel in that illustrious year of 1492. 
The Spanish grammatical tradition has remained until recently one of 
intolerant purism. Yet the Castilian dialect is deviant phonologically and 
innovating morphologically, and far from a koineized compromise 
between Iberian varieties. It nevertheless constitutes the official language 
of the whole of the Spanish state, which all Spanish citizens have the duty 
to learn. 

Today racier Southern versions of the language, which resemble in some 
phonological respects the Latin-American varieties, have been acquiring 
more prestige than what is often perceived as the rather stiff and harsh 
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idiom of Castile itself. In Spanish America the nineteenth-century 
Venezuelan statesman and grammarian Americo Bello sought to codify 
the language for the new states freeing themselves from the colonial yoke, 
but by perfecting and rationalizing tradition, not by breaking with it. Thus 
differences of the American standards from European Spanish are more 
phonological and lexical than grammatical (see 8.7). 

In twentieth-century Spain, after a brief period in which Catalonia 
enjoyed some autonomy, the Franco era, with its ‘Espafia una’ policy, 
marked suppression of other languages in the peninsula. In 1978, however, 
new legislation established semi-autonomous regions throughout Spain 
and encouraged the protection within the regions of local ‘modalities’, as 
part of cultural patrimony, while still insisting on the role of Castilian as 
the language of the whole state. Standard forms of some of the regional 
idioms have since been accepted (see 8.3). 

(ii) Portugal 

The European standard is based on the variety characteristic of the 
capital, Lisbon, where phonological development was idiosyncratic. 
Even though the Brazilian variety, phonologically more conservative in 
some ways, differs considerably, European Portuguese speakers have no 
difficulty in understanding it (indeed Brazilian television soap operas 
are very popular in Portugal). But it should also be noted that the 
Portuguese, who regard themselves as adaptable, quick-witted and easy¬ 
going, readily comprehend both spoken and written discourse in Spanish 
(though the understanding is not reciprocated). By most objective stan¬ 
dards, it is almost only the phonological (and orthographical) differences 
that set the two languages off from each other (though the Portuguese 
point proudly to their distinctive future subjunctive and personal infini¬ 
tive - see 4.3). 

In the Middle Ages, the two varieties were conventionally used for dif¬ 
ferent literary genres, the ‘softer’ Western pronunciations being seen as 
more suitable for lyrics and the ‘virile’ Castilian for epic. In the sixteenth 
century even writers of the standing of Luis de Camoes or Gil Vicente 
alternated between the languages, and by this time Portuguese was regard¬ 
ed as less prestigious. Hostility between the two states is deep-rooted, and 
it was really only in the seventeenth century, after a period (1580-1643) 
under Spanish rule, that Portuguese shook itself free of Spanish and creat- 
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ed a standard almost deliberately in counterpoint to its neighbour. (For 
fronterizo see 8.5.) 

As a poor and backward maritime country which looked out more to 
the wider world and to emigration than to its internal development and 
education, Portugal did little until recently to encourage adoption of the 
standard by all its population, so that the diversity of dialects still persists, 
and literacy spread very slowly. One difficulty posed to codifiers by the 
phonological deviance of the standard was that of orthography. Reform 
followed reform, in some cases negotiated with Brazil. 

The devising of a standard ‘educated Brazilian’ has recently prompted 
extensive investigations into the forms actually used by the relevant speak¬ 
ers. Subservience to the older established standard is no longer well 
regarded among the elite, who nevertheless are antipathetic to the more 
‘creolized’ Brazilian varieties. 

(iii) France 

French is also usually seen as directly derived from the dialect of the capi¬ 
tal city, or at any rate of the He de France region in which Paris stands. Its 
adoption as the state language was a purely political move dating from the 
sixteenth and seventeenth centuries. In the early Middle Ages other 
‘dialects’, especially those of the west (Normandy), the north (Picardy) 
and the east (Champagne), were more regularly used for literary expression. 
However, it is sometimes claimed that the so-called dialectal differences 
manifested in the medieval texts are more a matter of the orthographical 
tradition followed by the scriptorium in which the manuscript was written 
than a reflection of local spoken usage. 

From the thirteenth century the vernacular replaced Latin in charters, 
and the chancery language used in different northern regions can be relat¬ 
ed to later attested spoken dialects. However, dialect traits usually are in 
the minority in such charters, and it is possible that in the whole of the 
north there existed a koine with attenuated dialect features. It was this koine 
that came to be identified with the metropolis to which people flocked from 
all regions, rather than with the more distinctive Central dialect that evi¬ 
dence suggests was used by the longer-standing Parisian populace. It was 
this koine too that ousted dialect in literature by the fourteenth century. 

Strengthening of state functions required the use of a common lan¬ 
guage and it was, under Francois I, a 1539 edict (of Villers-Cotterets) that 
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is usually cited as marking the triumph of French as we know it. However, 
the wording used gave rise, even at the time, to legal wranglings: langaige 
maternel francois ‘French mother tongue’ was taken by some to imply ‘the 
mother-tongue that I, as a subject of the French king, speak’, rather than 
‘the vernaeular of the He de France’. Whether the edict was directed main¬ 
ly against Latin or against dialect is disputed. What is certain is that the 
spoken usage of the illiterate masses would be largely unaffected by such a 
move. 

In the sixteenth century, intellectuals, backed by political power, exerted 
pressure for a state language to aspire to the prestige and the international 
acceptance of Latin. Debate raged about what vernacular form it should 
be based on, and especially about codification of its orthography and 
grammatical rules. No doubt one motivation for seeking to replace Latin 
was the advent of printing, which potentially would spread literacy to con¬ 
sumers who had not the leisure nor the patience for classical studies. 
Another was the Reformation, which sought to spread the word in a ver¬ 
nacular accessible to all - just as Luther is sometimes said to have virtually 
invented German, so Calvin was an important figure in the creation of 
French. 

Sixteenth-century polemicists on the whole preferred a ‘polynomic’ 
standard that would not sacrifice to bland uniformity the picturesque con¬ 
tributions of dialects, but would tolerate regional variation. Parisian culti¬ 
vated middle-class usage was, moreover, favoured over that of the 
Italianized court. The seventeenth century saw a switch in emphasis, espe¬ 
cially after Louis XIV sought to curtail the influence and provincial affilia¬ 
tions of the turbulent nobility, by gathering them around his court, and by 
fostering centrally controlled administration. The language imposed on 
the country, and the vehicle for a new literary blossoming, was that of the 
‘most sensible part of the Court’. The middle classes were quick to ape 
fashionable courtiers, in language as in other respects, but there is little 
indication that the adoption of the standard affected the mass of the pop¬ 
ulation, whose duties of devotion and dumb obedience required minimal 
verbal skills. Commentaries on dialect usage from this period took the 
form of comic parodies or admonitions to misguided speakers. 

By the time of the Revolution, though, it is quite likely that in much of 
northern France (especially in the towns where literacy rates were high) 
lower-class speakers did not think of their usage as different regionally, as 
distinct from socially, from that of their betters. In the south, and in mar- 



212 


The differences 


ginal and rural areas, however, dialects persisted as strongly as before. The 
Jacobin doctrine, that a common language was necessary to permit all citi¬ 
zens of the ‘nation’ (in the sense, here, of all whose loyalty is willingly 
given to the state) to take a full part in democratic processes, has been eon- 
demned as destructive of the local dialects. It was, however, more the nine¬ 
teenth-century expansion of education, conducted exclusively in French, 
especially after the introduction of compulsory primary education in 
1882, and the increase in mobility, both social and geographical, that effec¬ 
tively led to the virtual death of most North French dialects. It is only after 
the Second World War that local languages spoken within France gained a 
degree of recognition. 


(c) Nation-state standards 

The nineteenth-century ‘nation-state’ era of nationalism was initiated by 
the French Revolution. The ideology decreed that the state should be legit¬ 
imized not only territorially (with ‘natural frontiers’ and a ‘threshold size’) 
but also by the ethnicity and language loyalty of its citizens. 

(i) Italy 

Italian, although dating, as a standard, from the same sort of period as 
Spanish, French and Portuguese, was originally a written culture language 
with little governmental backing. It was fashioned from a sanitized literary 
version of the Tuscan dialect, gaining prestige from the earlier writings of 
the Tre Corone - the ‘three crowns’ of Florence, Dante, Petrarch and 
Boccaccio - whose usage was taken as a model by sixteenth-century 
arbiters (notably Cardinal Bembo). It was Rome, at the centre of the Papal 
States, that promoted the literary language as the common language of the 
peninsula, but local dialects remained dominant, as they still continued to 
do after the Risorgimento and Unification. This latter was virtually com¬ 
pleted by 1860, at the height of nationalist fervour. 

Although the new King of Italy, and his court, were, as northerners, 
more at home in French, Italian was adopted as the ‘national language’ to 
mould the very diverse regions of Italy into one cohesive polity. The doc¬ 
trine adopted was that dialects of Italy must be ‘Italian dialects’, with 
close historical and cultural affinity. ‘We have made Italy’, said one states¬ 
man, ‘now we must make Italians’. Objective criteria for the affiliation of 
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dialects were ignored: for instance, the justification offered for surrender¬ 
ing Nice to the French in 1860 was, surprisingly, that Nigois was remote 
from Italian. Linguistically, however, it is closer to the neighbouring 
dialects of Liguria than to those of Provence. 

Today about 60 per cent of Italians (compared with 2.5 per cent in 1860) 
do claim the standard language as their first tongue, but regional variation 
is great. Monolingual dialect speakers are rare nowadays, but even when 
they still existed the majority could with an effort usually understand 
Italian (but, note, so can Spanish speakers). Many Italians, especially in 
the north, with a tradition of dialect literature, control at least three relat¬ 
ed languages: a local dialect, a regional koine and a regional standard (see 
8.4). In some urban centres, like Florence and Rome, some speakers may 
think of their dialect as ‘bad Italian’, rather than as a separate idiom, but 
more often loyalty and affection are bestowed on the dialect rather than 
the standard. However, interdialect communication is rare: television 
viewers claim that popular theatrical representations in dialect from 
regions other than their own are incomprehensible to them, and southern 
immigrant workers in the northern industrial centres often feel excluded 
by their ignorance of the Northern local dialects. 

Recent political developments have seen support for separatism among 
northerners, but the 1946 regional legislation did much to defuse feeble 
cries for separate standard languages, and dialects, however diverse lin¬ 
guistically, remain within the orbit (or under the roof) of a standard that is 
tolerant of variation. 

(ii) Rumania 

Codification of the Rumanian language was carried out, largely with 
French encouragement, with the aim of differentiating it maximally from 
its non-Romance neighbours. The basis of the standard was the Daco- 
Rumanian educated usage of the capital Bucharest, but, as we have seen, 
there was comparatively little dialectal diversity among Rumanian speak¬ 
ers within the state. Latin orthography was imposed and many Western 
loanwords were introduced to replace, or supplement, older Slavonic 
words. Although Rumania has always had substantial minorities of other 
language groups (including Gipsies - see 8.6), the establishment of the 
Rumanian state in 1862 (and its enlargement by the incorporation of ex- 
Hungarian Transylvania after the First World War) was conceived in the 
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‘nation-state’ spirit of the time, which modified the ‘nationality’ require¬ 
ments by those of economic viability. A state which filled the ‘threshold 
size’ requirements of nationalist ideology would inevitably in the Balkans 
include subordinate national groups. 

Irrespective of the size of the state, the between-war period brought 
little prosperity and much conflict to Rumania; emigration of intellectuals 
and artists was particularly intense. Postwar, when language questions 
were at first enthusiastically pursued, early hopes of a well-ordered, pros¬ 
perous and tolerant future were soon to be dashed, and today Rumania 
remains in a sorry state, with, once again, many of its intellectuals seeking 
refuge abroad, and ethnic and social conflict barely held in check. What 
does seem to have changed is that now the other Romance languages hold 
less attraction for Rumanians than, say, English or German. But there is at 
present little in the way of coherent discussion of the role of the standard 
in the new Rumania. 


(d) Small-nation standards 

Newer Romance standards have been developed in the mode of the ‘small- 
nation’ doctrines of Woodrow Wilson, but with the added incentive of the 
promise of European integration and federation and the dissolution of the 
large European nation-state. Some groups of dialects have sought to free 
themselves from the orbit of the state language to which they have unwill¬ 
ingly been subordinated (on language conflict see 8.3). Standardization is 
seen by activists as a means of breaking with the parish-pump associations 
of dialect and fitting their mother tongue for the modern world, by 
increasing its scope both in catchment area and in subject matter. (For fear 
of language death see 8.3). Earlier associations of the assertion of national 
language rights with irredentism and separatism evaporate, as new stan¬ 
dards look for validation to Brussels, rather than to Paris, Rome or 
Madrid. 

(i) Spain 

Since 1978 there has been official encouragement for standardization of 
local languages within the Spanish state. Standard Catalan, a success 
model for other small nations, is based on Barcelona usage, but taking into 
account other varieties. Nineteenth-century codification was purist and 
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archaizing, but in the twentieth century more liberal doctrines developed 
by Catalan activists have strongly influenced other national language 
movements. Today the processes of normalization or ‘status planning’ (fit¬ 
ting the language for a wider range of social roles) and normativization or 
‘corpus planning’ (codifying the language) are handled sensitively and pro¬ 
fessionally, with moral and financial support from the regional govern¬ 
ment. 

Standard Galician, even though it is spoken by a larger proportion of 
the local population, has had a rather harder time than Catalan, partly 
because of disagreements about the shape standardization should take. 
Unlike industrialized, go-ahead Catalonia, Galicia has long been a back¬ 
ward, impoverished part of the country, reliant on agriculture and fishing 
(though from very early on a centre of tourism, with the popular pilgrim¬ 
age centre of Santiago de Compostela). The local language was sup¬ 
pressed by the political and religious authorities and spurned by the local 
middle classes, so that the nineteenth-century revival of national lan¬ 
guages was not so successful in this region as elsewhere. However, the very 
poverty and neglect of Galicia helped to preserve the traditional dialects 
and also created some sense of national identity, in that the masses of 
enterprising Galicians who emigrated to the New World to seek their for¬ 
tune, nostalgic for their verdant homeland, kept the local language alive 
and gave moral and financial support for its rescue. Language activists, 
however, were split between those who advocated standardization along 
the lines of Portuguese, and those who preferred to go it alone. Post- 
Franco liberalization of language policy has allowed the Galician regional 
Xunta to encourage use and diffusion of the language, but opinion is still 
split on how far the sway of Castilian, as the language of the Spanish state, 
should be challenged. 

For other Spanish languages, Valencian, Asturian and Aragonese, 
where support for standardization is less strong, and the language debate 
is often animated, see 8.4. 

(ii) France 

In France, the medieval literary language of the south was a skilful blend¬ 
ing of the Poitou dialect with other varieties. How close it was to spoken 
language in the area is hard to tell, but as an artistic language it was fairly 
unified. After the violent crushing of southern cultural individuality by 



216 


The differences 


the north in the thirteenth century, the language continued to be used for a 
while for literary purposes in the south and in Catalonia, and, indeed, it 
was codified at this time. But by the seventeenth century it had been 
degraded to a mere set of low-status patois, roundly condemned by lan¬ 
guage arbiters. Although the dialects were still in use as the first language 
of most Occitans in the nineteenth century, French language and educa¬ 
tion policy has since succeeded in reducing their scope and status. They 
declined rapidly till the middle of this century and were confined mainly to 
older users in rural communities. The literary revival led, from 1854, by the 
Felibrige did seek standardization, but only locally based (specifically the 
Provence varieties), and attempts to devise a standard language that would 
serve the needs of speakers across the region were left till the mid-twentieth 
century to militants inspired mainly by principles enunciated by the 
Catalan standardizers. Standard Occitan has had a degree of success since 
1951, when it was admitted as a subject for school examinations: the result 
has been a noticeable slowing down of the decline in its use among young 
people, and an increase in use in urban environments. However, there is no 
success commensurate with that of Catalan standardization, and a great 
deal of disagreement about how many Occitan speakers remain (estimates 
ranging from about 1 million to an unlikely 10 million). 

The dialects spoken in Corsica are usually classed as Tuscan, but influ¬ 
ence from French, and decreasing contact with Italian, since the island’s 
incorporation into the French state in 1769, has given them a more sharply 
silhouetted profile. Attempts to standardize the language have aimed at 
the ‘polynomic’ approach favoured for Occitan, which does not force 
speakers of different dialects to abandon their own usage, yet allows them 
to participate in a wider linguistic community. It remains to be seen 
whether a standardized Corsican will catch on, now that the island has a 
significant proportion of non-Corsican residents, especially since the set¬ 
tlement there of refugees from Algeria after independence. The island, 
moreover, is a holiday centre for both French and Italian speakers. 
Unjustifiably linked by opponents with a more violent separatist move¬ 
ment, the Corsican language cause attracts little official support. 

(iii) Switzerland 

Although moves have been made from 1880 to devise a standard to 
accommodate the diverse Swiss Rhaeto-Romance dialects, a common Ian- 
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guage did not get going until 1982. Inevitably, as the largest number of 
speakers use Surselvan, it is based largely on that set of dialects, but great 
(some say too great) efforts have been made to take in Engadine features. 
The codified koine Rumantsch Grischun is welcomed especially by those 40 
per cent of speakers who now work outside the Orisons canton, who fear 
that their children will have no proper access to the language, but within 
the canton there is comparatively little support for it to be used in schools. 
Given the quite wide differences between the local varieties, for instance in 
ways of negating the verb, it would be preferable to allow some leeway 
within the standard for variants, rather than imposing a single norm. It 
has yet to be seen whether the creation of a standard will halt the erosion 
of use of the dialects. 


5.6 LANGUAGE AND DIALECT 

Grouping of European Romance ‘dialects’ is usually done on the basis of 
genetic classification and of the ‘roofing’ language. They are nearly always 
contiguous territorially and usually derive their names from the area in 
which they originate (for discussion of the social status of dialects see 
chapter 8). Only in the Balkans, in southern Italy and in Sardinia are there 
pockets of Romance speakers of varieties that differ radically from those 
that surround them. 

In the south of Italy and Sicily there survive isolated Northern Italian 
(Gallo-Italian) dialects in immigrant settlements, attracted there by royal 
patronage, mainly during the twelfth and thirteenth centuries, partly to 
repopulate areas after expulsion of Saracen dissidents. They have not 
accommodated to their neighbours’ Romance dialects sufficiently to have 
lost all their distinctive dialect features. In northern Sardinia, influxes of 
Corsican refugees to what were presumably unpopulated areas also consti¬ 
tuted dialect groupings distinct from the indigenous Sard dialects. 

The Balkans harbour some Romance varieties surrounded and inter¬ 
mingled with non-Romance. These include a sprinkling of Arumanian 
dialects in Albania and Macedonia. Although the earliest inscriptions date 
from the eighteenth century, they are thought to date back to Roman times 
and to be more archaic in character than the Daco-Rumanian varieties of 
the Rumanian state. A smaller pocket of different Rumanian dialects - 
called Megleno-Rumanian - precariously survives in the north-east of 
Macedonia. Some of the Macedonian Rumanian speakers settled in 
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Rumania after the Second World War, escaping from the conflicts between 
Slavs and Greeks. The tiny groups of Rumanian speakers in the Croatian 
Istrian peninsula, on the other hand, feel no affinity with the Rumanian 
state, even though their dialects are closely related to Daco-Rumanian. 
(For Friulian speakers in Rumania see 5.1.) Some Judeo-Spanish dialects 
(see 8.7) also survive in the Balkans and in Turkey, carried there after the 
1492 expulsion from Spain. 

(a) Dialect and regional standard 

A distinction is usually made, by linguists though not necessarily by speak¬ 
ers, between the dialect of a locality and the regional version of the stan¬ 
dard within whose orbit the dialect is drawn. A dialect is assumed to have 
descended directly from Latin, whereas the regional variety is seen as a 
modification of the imported ‘roofing’ language. This is one of the reasons 
why the dialect usage of monolingual elderly rural inhabitants is regarded 
as more ‘authentic’ or ‘purer’, in that it is more archaic and less likely to 
have been contaminated by contact. However, as we have seen, the degree 
of mutual intelligibility between different Romance varieties has probably 
meant that they have always interacted. On the other hand, use of some 
local dialect features may be regarded as a community membership badge 
(see 8.3). The consequences of accommodation between different 
Romance varieties are discussed in chapter 4. 

(b) Overseas dialects 

When a Romance-speaking community has established itself in a far-off 
land, its usage over time will differ from that of the metropolitan variety. 
The degree to which this happens is related to the amount of contact with 
the metropolis, and to the social attitudes of the speakers. We shall be 
examining American Spanish, Portuguese and French in 8.7, and consid¬ 
ering how closely they are related to metropolitan dialects or standards. 
Their linguistic distance from the home languages is sometimes described 
in terms of creolization by contact with other languages. Here we shall 
consider in what ways we can distinguish between a non-European 
‘dialect’ and a true ‘creole’ which, although still a member of the Romance 
family, no longer conforms to the Romance type, and has resigned from 
the Romance club (see page 95). 
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5.7 CREOLE AND DIALECT 

The difference is not always clearcut, especially in those regions in which 
varieties that span the two are used. I shall not here go into the general prob¬ 
lem of coverage of the term ‘creole’, as it is one that originated as a 
Romance word and, within Romance, refers only to those language varieties 
that arose between the fifteenth and eighteenth centuries following contact 
of European Romance-speaking adventurers, traders (especially slave- 
traders) and colonists with non-European populations. These varieties have 
some common features, but also differ widely among themselves. 

Not all Romance creoles are so called by their speakers: French 
Caribbean varieties are often called patois', other varieties receive a name 
that means, sometimes pejoratively, ‘language’ (papia(mentu), lengua, chaba- 
cano, etc.) or that refers to the name of the region. ‘Creole’ in French was a 
name originally given to French colonists, so that in Louisiana, for instance, 
it referred to a variety close to standard French, whereas the creole was 
known as ‘Negro-French’, or, pejoratively, ‘gumbo’, the Bambara word for 
‘okra’. Just as the designation differs, so does the association between differ¬ 
ent Romance languages and their related colonial offshoots: some of the 
overseas varieties may be more profitably regarded as ‘dialects’ than as cre¬ 
oles, retaining many of the grammatical features of their lexifier languages. 


(a) Spanish 

It has been argued that this may be true of Spanish-based varieties. Little 
reliable data exist for Philippine Spanish, which has been seen in any case as 
a relexified Portuguese creole. Papiamento, spoken in Curagao and neigh¬ 
bouring islands, is also sometimes thought to be originally Portuguese, and 
is regarded as an atypical creole (see 1.3). Palenquero, spoken in an origi¬ 
nally maroon (runaway slave) settlement in Colombia, has only recently 
been classified as creole, and even then is seen as heavily ‘decreolized’, as its 
3,000-odd speakers all speak Spanish (kateyano). It is better then not to use 
these Spanish varieties as examples in our quest for ‘creolity’. 

(b) Portuguese 


Portuguese-based creoles are a different matter, as there is a long documented 
tradition of lingua do preto, ‘black’s language’, of the pidgin that was used by 



220 


The differences 


mariners and traders throughout the world. However, the creoles found in 
Africa differ from Asian varieties (which in any case are now obsolete or 
obsolescent), especially in that they are still in close contact with Portuguese, 
and also, in some cases, with African languages. In Cape Verde, crioulo is the 
national language of a mainly mixed-race population; variants range from 
something very like colloquial Portuguese to a version that is more like that 
of Guinea-Bissau, where crioulo is widely used by Africans as a second lan¬ 
guage. Gulf of Guinea (Sao Tome, Principe, Annobon) creoles grew in social 
conditions that were more similar to those of New World creoles - slavery in 
a plantation agricultural system - with more infiltration of African features 
and lexicon, distancing the languages more decisively from Portuguese. 

Asian Portuguese speakers - usually Christian and of mixed race - iden¬ 
tify, on the other hand, more closely with Portuguese: intermarriage and 
assimilation were encouraged by the metropolitan authorities. In so far as 
their languages survive, they vary widely; Macao Portuguese could be 
equated with popular metropolitan and Brazilian varieties, while that of 
Batticoloa in Sri Lanka is now penetrated by Tamil grammatical features. 
Popular Brazilian has itself been called a ‘semi-creole’, in that inflections 
are much reduced compared with the standard, and periphrasis is 
favoured. However, many non-standard metropolitan Romance varieties 
show similar tendencies. Indeed the term frangais avance for popular vari¬ 
eties of French (see 8.2) implies the view that Romance’s future lies in 
simplification of morphology and greater reliance on syntactic devices. 

(c) French 

It is the French-based creoles that most closely approximate to the more 
traditional idea of what a creole is. They are all the product of importation 
of African slaves for plantation (tobacco, cotton, sugar) agriculture, dur¬ 
ing the seventeenth and eighteenth centuries, and they all have similar 
grammatical features, even though they are geographically as far apart as 
the Caribbean and the Indian Ocean. It is easy to imagine that they could 
have been created from an amalgam of French lexicon and African gram¬ 
mar; there are difficulties, however, in tracing the precise African origins. 

(i) Caribbean 

The French creole which deviates most radically from the Romance pat¬ 
tern is that of Haiti, spoken by some 6 million people, most of them 
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monolingual, in a country that won its independence, after slave revolts, in 
1804. Many of the other French creoles have been used continually along¬ 
side French since their creation. Indeed, until recently they were often not 
perceived as other than a variety of French: comments on the ‘otherness’ 
of creole mainly date from the eighteenth century. Today, in the French 
Caribbean Departements d’Outremer (DOMs - Guadeloupe and 
Martinique), creole is widely used among all sections of the population, 
with funetional differentiation and frequent code-switching between creole 
and French. Regional popular French varieties are tending to disappear as 
standard French and creole polarize, so that the ‘creole continuum’ that 
has been described for New World English-based creoles is not so evident. 
In islands where English is the official language - like St Lucia and 
Dominica - the creole seems to be less influenced by French, and is threat¬ 
ened with extinction (as in Grenada and Trinidad). On the coast of 
Dominica facing Guadeloupe the creole survives better, presumably 
because contaet is maintained with Guadeloupian creole. 

There are, however, some cases in which French ‘dialects’ and ‘creoles’ 
are said to exist side by side and are not clearly demarcated: in the 
Caribbean, the Indian Ocean and in North America. 

(ii) St Barth and St Thomas 

A curious linguistic situation is reported for the tiny, dry, rocky Caribbean 
island of St Barth(olomew) in the DOM of Guadeloupe, which has an 
almost wholly white population, reputedly descended from Norman immi¬ 
grants in the seventeenth century. In the west of the island French varieties 
that some characterize as ‘Norman’ are used, while in the east there is a 
creole that is readily interintelligible with that of Guadeloupe and 
Martinique. Each part of the island was said to practise endogamy, at least 
until the 1970s. Few preeise linguistic or sociolinguistic details are yet 
available, although investigations are in progress: the fact that the island is 
now an exorbitantly expensive winter resort for the international smart set 
cannot help research. Certainly the ‘dialect’ is much closer to French than 
the ‘creole’ (however, as all education is in French there is likely to be cont¬ 
amination). Examples from the verb morphology of each variety illustrate 
the French of the dialect compared with the creole: 

(2) Dialect [jekimas] / [jetekimase] / [imasra]; creole [ikama 3 e] / [itekamase] / 

[ikemdse] ‘he is eating / he was eating / he will eat’ (= French il mange [il est qui 
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mange] / ;7 mangeait [il etait qiii mangeait] / il mangera). Dialect (like French) Je 
te le donne; creole Mwe ka ba ou li ‘I give it to you’. 

More is known about the ‘dialect’ that was reportedly imported in the 
nineteenth century from St Barth by fishermen immigrants to Carenage in 
St Thomas, now part of the US Virgin Islands (and, incidentally, the birth¬ 
place of the impressionist painter Pissarro). There is little that differentiates 
their French from other non-standard American varieties. A farming com¬ 
munity of St Barth origin at Northside in the same island is said, however, 
to use creole, but it has not been studied. In both communities English is 
reportedly currently in use, but the respective French varieties are said to 
be mutually intelligible. 

It is possible that the St Barth / St Thomas creole was imported from 
Guadeloupe by refugees during the Revolutionary period, but it has also 
been suggested that it was an indigenous creation, at a period (late seven¬ 
teenth / early eighteenth century) when apparently slaves formed about 40 
per cent of the population of St Barth. 

(iii) Indian Ocean: Reunion 

This volcanic island (originally called Bourbon), a French DOM and part 
of the Mascareignes group in the Indian Ocean, uses what is locally called 
‘creole’. This is spoken, with a wide degree of variation, by a poor white 
population mainly in the mountainous interior, descendants of eigh¬ 
teenth-century immigrants, by communities largely descended from 
maroon slaves in inaccessible localities in ex-volcanic craters, as well as by 
the racially even more mixed population on the coast. Most commentators 
agree that these varieties have evolved from an archaic popular French; 
some would deny they are creoles, rather than dialects. Their relationship 
to the undisputed creoles of Mauritius (earlier called He de France) and 
the Seychelles is hotly disputed. The creoles of these last two island clus¬ 
ters are closely related, and are more decidedly creole in character. 
Probably the Seychelles varieties were originally, in the nineteenth century, 
imported from Mauritius, when, on to the previously unpopulated archi¬ 
pelago, liberated slaves were disembarked from ships intercepted by the 
British. In both the Seychelles and multi-ethnic Mauritius, English is the 
official language but creole, the spoken language of all, is gaining ground 
and prestige. In Mauritius there is still some influence from French, via the 
wealthier Francophone community, mainly of European origin, and pri¬ 
vate educational establishments. 
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(iv) Louisiana 

French varieties of Louisiana also can be divided into dialects and creoles 
(leaving aside the approximation to standard French that was once used by 
plantation owners). The cajun dialects are derived ultimately from the lan¬ 
guage of those immigrants who eventually ended up in western Louisiana 
after their expulsion from Acadie (now part of New Brunswick and Nova 
Scotia) by the British in 1755, supplemented by Francophone settlers from 
the Mississippi valley. Modem speakers include other ethnic origins, including 
German immigrants and Amerindians, while some of the more prosperous 
descendants of original cajuns identified with English-speaking plantation 
owners in the nineteenth century. Cajun shares many of the features of mod¬ 
em Acadian and other Canadian varieties (see 8.7). It is usually contrasted 
with ‘Negro-French’, negre, characterized jocularly as mo gain (French moi 
gagne) ‘I have’, or mo court mo vini ‘I’m going to come’ (French moi courir 
moi venir). It is reported that this creole variety could still be heard on the 
streets of New Orleans in the prewar period. Nowadays no sharp contrast 
between cajun and creole can be drawn, and doubt is not absent about 
whether they were ever clearly differentiated. Creole characteristics may have 
been reinforced in some localities by the influx of refugees from Haiti during 
the slave revolts, when masters brought some of their slaves with them, but 
there is evidence of the existence of Louisianan creole before then. 

Black Louisianans nowadays show preference for English, and indeed 
many are descended from the English-speaking slaves brought in from the 
north during the early nineteenth century to work the sugar plantations. 
However, in the western bayou and prairie regions in which French sur¬ 
vives, somewhat precariously, sharp differentiation between the language 
of blacks and whites is not evident. Some localities, like Lafayette or 
Avoyelles, use more cajun-Mke varieties, others, like St Martin, use a fairly 
conservative creole that has nevertheless Haitian-like features. Where it is 
claimed that there are two distinguishable varieties, as at Vacherie (St 
James), the cajun tends to be penetrated by creole, with frequent code¬ 
switching and bilectal exchange, even within the family circle. Since the 
mid-nineteenth century, when cajuns tended to be treated as ‘white trash’, 
there has been some sense of identifieation of all loeal French varieties 
with under-privileged social status. The distinction between ‘white’ and 
‘black’ French can well have been exaggerated in the post-Civil War 
accounts that we have, which not infrequently have racialist overtones. 
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In cases like these it is often claimed that two originally quite distinct 
languages have with time intertwined; the dialect has ‘creolized’ and/or the 
creole has ‘decreolized’. Incidentally, attempts to introduce standard 
French into the Francophone regions of Louisiana, through the COD- 
OFIL programme, have not been conspicuously successful, especially as 
speakers of the local varieties find the language taught almost as alien as 
English. At present there has been instituted a major research programme, 
under the direction of Albert Valdman, to investigate Louisianan French, 
in particular cajm varieties, which seem to be in danger of extinction, and 
about which we have little firm information. 

5.8 CREOLIZATION AS CREATION, OR AS 
CATASTROPHIC CHANGE? 

Identification of a creole is intrinsically bound up with ideas about cre- 
olization. The most prevalent theory about the way creoles are formed is 
that they are mixed languages that draw their lexical substance from one 
language - in our examples a Romance language - and their grammatical 
structure from another. One difficulty with this theory is often in identifying 
the source of the grammatical structures, and in tying together the linguis¬ 
tic and the historical evidence. Creoles are often also seen as originating in 
simplified ‘babytalk’ used in limited exchanges between speakers with 
mutually unintelligible languages. The way in which such ‘pidgins’ can in 
modern times become fully fledged languages is matter for the develop¬ 
mental psycholinguist, but we have little evidence about such processes in 
the past. The creoles that arose during the slave-trade period are thought 
by some to result from relexification of the Portuguese slavers’ pidgin, by 
the simple replacement of Portuguese items by those of another language. 
This would account for the surprising similarity of structure between 
many of the relevant creoles. 

The most interesting recent hypothesis, by Derek Bickerton, that seeks 
to account for such similarities suggests that, when children have to create 
a language from incomplete and distorted linguistic data, in conditions 
which preclude the more normal language acquisition processes, a lan¬ 
guage ‘bioprogramme’ is at work. In the slavery context, it is assumed, the 
children would be exposed to ad hoc pidgins composed of lexical items 
culled from the master’s repertoire and used by adults who shared no com¬ 
mon language. With no normal access to a fully functioning natural Ian- 
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guage, the children will use their innate language capacities to create for 
themselves a new language, using the lexical material at hand. 

In this perspective, a true creole would be the one that adhered most close¬ 
ly to the bioprogramme and had least input from the master’s language. 
Communities descended from runaway slaves (maroons) would be more 
likely to have more authentic creole features. None of the Romance creoles 
fits well into this pattern (though for Angolar and Saramaccan see 2.3). Yet 
some varieties are more ‘imperfectly creolized’ than others. The creoles of 
Haiti and Sao Tome are more radical than those of Reunion or Louisiana, 
not because the African input is greater, but because contact with the model 
language was less intense. The distinction between a dialect and a creole 
could similarly depend on the degree of continuity of transmission from one 
generation to the next: a creole would be a new creation, whereas a dialect 
would be the result of an evolution, a handing down of tradition. 

Another view of creolization, much favoured by Romanists, is that it is 
an extreme and rapid version of linguistic change, brought about in social 
conditions in which there was a break in continuity of transmission. 
Simplification of inflectional morphology, loss of redundant distinctions, 
like that of gender, reliance on word order to signal grammatical relations 
and use of analytic constructions are apparently merely extensions of ten¬ 
dencies that are observable in the history of all the Romance languages. 
Some languages, like French, have gone further along the line; others, like 
Rumanian, have held back. In this view the creoles have rushed forward at 
a breakneck pace, some even faster than others. There would therefore be 
no qualitative difference between a dialect and creole, merely a quantita¬ 
tive one, which would depend on such things as difference in social condi¬ 
tions, geographical position, language contacts and so on. Creoles would 
take their place towards the bottom of a continuum of Romanceness 
which would include at different points on the scale the rest of the 
Romance family. 

5.9 CONCLUSION 

The answer to the question that we posed in this chapter must be an inde¬ 
cisive ‘It depends what you mean . . . ’ The Romance languages, dialects 
and creoles can be seen, from one perspective, as an unbroken continuum, 
in which certain sectors are brought into focus by standardization or 
koineization for particular social purposes. 
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In the next two chapters we shall examine how some of the linguistic fea¬ 
tures that differentiate one sector of the geographical continuum from 
another could have arisen over time. In the final chapter we shall glance at 
the social factors that add other dimensions to the relationships of variants. 


FURTHER READING 

The works already mentioned in the Further Reading for earlier chapters give 
ample bibliography to supplement this chapter: consult especially Harris & 
Vincent (1988) for a succinct description of the main languages and of the 
creoles, and Posner & Green (1980-93), vols. ii, ill and iv for dialectology, stan- 
dardology, ‘smaller languages’, overseas Romance and creoles. On classifica¬ 
tion the most recent substantial work is Manczak (1991); Embleton (1986) 
gives an up-to-date account of lexicochronological findings for Romance. On 
dialectometry, Goebl (1984), and on dialect variation, Lang (1982), Sankoff 
(1986), Fisiak (1988), Roberge & Vinet (1989). Beninca ed. (1989). On 
Romance typology, see Manoliu-Manea (1985), Raible ed. (1989), Komer 
(1987). The most general recent synchronic or diachronic linguistic works on 
individual languages include the following: on Catalan, Hualde (1992); on 
French, Jones (in press). Lodge (1993), Battye & Hintze (1992); on Italian, 
Lepschy & Lepschy (1989), MuljacSic (1991), Maiden (1995); on Portuguese, 
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When did the Romance 
languages differentiate? 


6.1 INTRODUCTION 

The question of the split of a uniform Latin into several distinct languages 
has always formed the main subject matter of Romance historical linguistics. 
Older thinkers, like Dante (see page 2), took for granted that languages, like 
all things human, change and decay, and saw the differentiation as an 
inevitable consequence of the passage of time. The geographical spread of 
Latin over a large Empire was also seen as causing inexorable disintegration 
and disruption of language unity; thus the seeds of development of each 
Romance language were there from the beginning in each region, but masked 
by the apparent steadfastness of the written language. Classic nineteenth- 
century Romance philology looked mainly for the sources of differentiation 
in the Dark Ages, between the administrative breakup of the Empire and the 
appearance of the first Romance texts at the start of our millennium. In more 
modem times there have been suggestions that the differences stemmed not 
so much from negative forces - like organizational chaos, breakdown of com¬ 
munications, terrorization by marauders - but from relatively positive social 
and cultural developments which affected some areas earlier than others - 
like Christianity’s solicitude for the salvation of the humble and the growth 
of concern for individual welfare. It is even possible that the perception of 
gulfs between the languages may date from much later, possibly not emerging 
into popular consciousness until the Renaissance period, hand in hand with 
the idea of the modem state and political society. 

6.2 EARLY DIALECTALIZATION? 

But what degree of differentiation was there in Latin before it spread 
beyond the Italian peninsula? In 3.2 we saw that it has been claimed that 


227 



228 


The differences 


the Romance languages derive from a sociolect of Latin, or from a related 
Italic dialect, rather than from the language of the classical texts. Some see 
the language of Rome itself as a hybrid between different Italic dialects, 
and as it was imposed, by about 270 BC, on the rest of the peninsula, over¬ 
laying numerous other tongues, it must, in its spoken form, have acquired 
a degree of flexibility. But we have comparatively little evidence for dialec- 
talization of Latin within the peninsula, and we are unsure how long the 
other languages continued in use. 

(a) The influence of Greek 

The literary language of the Golden Age, however, is thought to have been 
profoundly influenced by Greek, to which it looked for prestige and style. 
The Greek-orientated education of the Roman aristocratic classes could 
not fail to penetrate the high style of serious literature and especially poet¬ 
ry. We have already mentioned the possibility that the metrical use of 
quantity, rather than stress, was not so much a native Latin development 
as an Imitation of Greek, and borrowings from Greek into the literary lan¬ 
guage are numerous. Grammatical writings are also clearly inspired by 
Greek grammarians; one has the impression, though, that the more 
methodical and disciplined Romans were readier to codify and conform to 
a standard usage than their Greek subjects. One effect of these tendencies 
may have been greater sociolectal differentiation, as the spoken language 
developed untrammelled by the rigid requirements of high style. 

Influence from Greek came also from another direction, in the shape of 
the numerous Greek slaves, who were among the first to adopt 
Christianity. Some Greeks, as freedmen, also acquired considerable 
wealth and influence in the later period. Such Romance developments as 
the creation of a definite article or of the compound perfect are often 
linked to the impact of Greek on later spoken Latin. 

(b) Social differences 

That there were differences in language use among different sectors of the 
Latin-speaking population is not doubted: we have enough direct and 
indirect testimony from classical times. What is in doubt is how far such 
differences account for divergence between the Romance languages. In his 
classic study of Romance differentiation, Wartburg (see 5.2) distinguished 
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eastern and western areas, with the dividing line drawn across Italy from 
La Spezia to Rimini. One hypothesis he advanced was that the eastern part 
of the Latin-speaking Empire continues a lower-class variety, whereas the 
Western Empire was more subject to the linguistically conservative influ¬ 
ence of schooling. In most of the eastern part Romance has not survived 
to the present day; the evidence adduced relates mainly to southern Italy 
and to Rumania, with some reference to Dalmatian. It is held that Dacia 
was settled by veterans from the legions, who would not be expected to use 
high-flown Latin style, while in the south of Italy a popular Latin may 
have survived unscathed since ancient times. 

One feature that Wartburg picks on in his east/west distinction is the 
treatment of Latin word-final -s. All the inscriptional and textual evidence 
points to the tendency of this sound to disappear in colloquial usage, and 
in early literary Latin it did not ‘make position’ (see 3.3) before a word- 
initial consonant - a feature that was regarded as rustic in classical times. 
It is usually assumed that in Late Latin -s was no longer pronounced in 
colloquial usage: the collapse of the Latin case system can be linked with 
the disappearance of the distinction between nominative and accusative 
inflections s/m. 

In Romance, however, the -s persisted (at least at first) in the west, here 
including Sardinia, while disappearing in the east, though after a stressed 
vowel it seems to have vocalized to [i] or [j]: 

(1) TRES ‘three’ > Rum. Irei, Dalmatian (Veglia) tra, Ital. tre (earlier tret)! Sard 
(Logudorian) tres, R-R (Engadine) tmis, French trois, Occ., Cat., Span., Ptg. 
tres. 

More plausible is a development that involves lengthening and diphthongiza- 
tion ([tre:] > [trei]). 

In the east the accusative plural forms for most nouns thus would be iden¬ 
tical with the singular, whereas in the west the [s] clearly marked the plural. 
The Rumanian and Italian plural forms are thus often assumed to be 
derived from the nominative rather than the accusative (see 3.3): 

(2) capre ‘goats’ < caprae / Sard (Logudorian) [kraCas], French chevres, Span. 
cobras < capras. 

However, in Italian (unlike Rumanian) this plural marker -e has not, unlike 
the regular reflex of Latin AE, provoked palatalization of preceding [k]: 

(3) amiche [amike]‘friends’ / Cesare < Caesare. 
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This suggests that it may have levelled from a diphthong [ai] < -as. 
Similarly the i which serves as masculine plural marker and the person 2 
verbal marker (and which in Rumanian is realized phonetically by palatal¬ 
ization of the preceding consonant) is more plausibly derived from -ES, 
rather than-I: 

(4) cani, ciini ‘dogs’ < canes; vedi, vezi ‘you see’ < vides. 

The word-final 5 of the western languages not only marked the nominal 
and pronominal plural and the person 2 verb endings, but in Old French 
and Occitan was the mainstay of the nominal case system (see 3.3). 
Nevertheless it has tended to weaken and disappear over time in many of 
the varieties. 

Even in Sard we find assimilation of -s before voiced consonants: 

(5) sar dentes ‘the teeth’, sa mmanos ‘the hands’ for sas dentesimanos. 

In Occitan and North Italian varieties the -s has frequently weakened to a 
palatal glide in similar conditions: 

(6) Gascony [eraj duoj rodos]’ the two wheels’ < illas duas rotas. 

In French word-final -s began to be lost in the twelfth or thirteenth centu¬ 
ry, and had virtually ceased to be pronounced by the seventeenth century. 
Today it is used only in liaison contexts, usually in tightly knit syntagmata: 

(7) les arbres ‘the trees’ [lezaBbK] / lespetits arbres ‘the little trees’ [leptizaBbK]; ils 
portent ‘they carry’ [ipoBt] / ils aiment ‘they love’ [izem]. 

Thus it can be argued that plural marking can be effected in speech by a mor¬ 
pheme-initial [z] marker, rather than by the word-flnal -s, indicated in the gra- 
phy. 

In Spanish, too, syllable-final s tends to open to [h] and sometimes to fall; 
there is evidence of this change from the sixteenth century. It is particular¬ 
ly noticeable in the south of the peninsula and in many American varieties, 
especially on the Paeific coast. Some trace of the marker of the plural is 
usually retained within the noun phrase, though not on every word: 

(8) los hombres ‘the men’ [lohombre], las manos ‘the hands’ [lahmano]. 

Elsewhere, however, as in eastern Andalusia, the s is completely lost and 
the burden of marking the plural falls on the preceding vowel: 

(9) boh [bolo] / bolos [bob] ‘skittle(s)’. 
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In some varieties of Colombia [s] is aspirated in other than word-final posi¬ 
tions (e.g. necesario [nehesario], una sefiora [unahejiora]). 

The glottalization of -s has been linked to the ‘Western’ palatalization 
(laminalization) to [J], which occurs sporadically in other positions: 

(10) Old Span, xabon ‘soap’< sapone, Ptg. baixo. Old Span, baxo ‘low’ < bassu 

(where x represents the spelling for [J], the modern Spanish [%] being writ¬ 
ten as f). Evidence for the standard European Portuguese palatal pronun¬ 
ciation of syllable-final -s dates back to the thirteenth century (though 
modern Galician uses an apical sibilant, and Brazilian, outside Rio, a dor¬ 
sal sibilant). 

6.3 SPREAD OF ROMAN RULE 

It seems obvious that as the Roman Empire, and its Western official lan¬ 
guage Latin, spread, regional variation would increase. Little reference to 
variation was made by contemporary commentators, and such remarks as 
have been preserved are ambiguous. Much effort by modern scholars has 
been devoted, without great return, to the meticulous study of ‘errors’ in 
late texts and imperial inscriptions from different areas, in the hope of 
uncovering information about such variation. Almost ineluctably, only 
phonological evidence is available; searches for early signs of some of the 
differentiating features of the languages we know - like voicing of intervo¬ 
calic consonants or merger of mid vowels - have so far yielded only meagre 
rewards. 

Yet vernacular texts from the early years of our millennium already 
show clear marks of phonological regional differentiation. The common 
assumption is that the Late Latin texts continued spelling conventions that 
masked pronunciation changes. Classic doctrine has it that two codes were 
current in the Romance area during the first millennium: one an approxi¬ 
mation to Latin, represented by the written texts and used by the educated 
for certain purposes, and another underground spoken vernacular in more 
general use, which, at first sporadically and then consistently, broke into 
the written form. 

Another view is that an evolved form of Latin, represented distantly by 
the texts, continued to be spoken and written until the Carolingian spelling 
reforms of the eighth century, which divorced the written from the spoken 
form and triggered experiments in writing the vernacular. A variation (see 
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3.5) on this view is that the ‘Latin’ texts were indeed conventional repre¬ 
sentations of the spoken vernacular until reforms of Latin pronunciation, 
under Charlemagne, re-invented a learned Latin that henceforth was in a 
diglossic relationship with the vernacular. 

The differences - especially phonological - between the different regions 
are often attributed to transference from the languages with which Latin 
was in contact. This is plausible enough; difficulties arise largely from 
chronological and sociolinguistic considerations, and from our paucity of 
knowledge about the source languages. One would expect that phonologi¬ 
cal transference would be accompanied by a considerable degree of lexical 
borrowing and morphosyntactical influence, but this is more difficult to 
trace. I shall look in the following sections at some of the Romance fea¬ 
tures attributed to language contact. A distinction can be made between 
substratum, superstratum and adstratum influence. 

The first type of interference - from the first language of a subjugated 
people who shift to their masters’ language (here Latin) - is so familiar that 
it needs no explanation. It has seemed obvious to most past Romanists 
that differentiation of Latin spoken over the Empire must owe a great deal 
to the languages of different pre-Romance populations. Unfortunately, 
most of these languages are now extinct, and our knowledge of them is 
defective. We can hazard guesses about possible phonetic interference but 
little else. There are also difficulties about the chronology of the interfer¬ 
ence phenomena. One would expect these to operate during a period of 
bilingualism, but to fade as the pre-Roman languages died out. We still 
have little knowledge about how quickly the populations embraced use of 
Latin (which they seem to have done fairly enthusiastically) or about 
regional variation in Imperial Latin. It does seem to be true that such vari¬ 
ation was so superficial that it did not impair communication across wide 
spatial areas, even after the institutional collapse of the Western Empire; 
hence scepticism among many modern Romanists about the validity of the 
substratum theories advanced. 

The second type of interference is more problematic. At the end of the 
Western Empire period, some Germanic-speaking ‘barbarians’, though 
dominant militarily, appear to have respected Roman culture enough to 
have adopted a form of Latin, so that today a Romance, not a Germanic, 
variety is spoken in, for instance, northern France, northern Italy and 
Spain. The hypothesis advanced, especially by German Romanists, is that 
the ‘Latin-speaking’ population adopted some of the features of the 
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‘faulty’ speech of their overlords, so differentiating their varieties from 
that of other Latin-speaking areas. We are on firmer ground here about 
the sort of features that may be affected, as the Germanic languages are 
well known to us - though we do not always know so much about the indi¬ 
vidual varieties involved. It is suggested in particular that Northern 
French shows interpenetration of many Germanic characteristics taken 
from the Franks, but we are uncertain about the Frankish variety, about 
the number of German speakers settled in the area, and indeed about how 
long they continued to speak a Germanic language. It has been claimed, 
for instance, that about half the Frankish loanwords in French date from 
the Imperial period. 

Other superstratum languages that may have had an influence on the 
development of post-imperial Romance varieties include Arabic - on the 
Iberian and South Italian varieties in particular (Romance seems to have 
been swept away from North Africa by the Arab incursions). Here lexical 
borrowing is evident, but other types of influence are more doubtful. The 
same may be said of Slavonic influence on Rumanian - though here it is 
difficult to distinguish between ‘superstratum’ and ‘adstratum’ influence. 

By the latter, we mean the influence of languages with which Romance 
is in contact geographically and socially. Again it is the lexicon that most 
clearly shows such influence, but other levels may also be affected. 
Rumanian, surrounded by Slavonic, Hungarian, Greek and Albanian, and 
part of the Ottoman Empire for a long time, was particularly subject to 
such influence. Elsewhere in Europe adstratum influence is detected in the 
Romance varieties along the frontiers with Germanic (though in modern 
times there seems to have been strong resistance to such interpenetration 
of features). Linguistic isolates, like the Rumanian of the Croatian Istrian 
peninsula, of Macedonia or of Albania, the Judeo-Spanish of the Near 
East, or the Romance varieties in southern Italian Greek-speaking areas, 
almost inevitably show contact influences. Non-European Romance vari¬ 
eties, in the New World, in Africa and Asia, can be seen as particularly 
under siege from other languages - at the extreme, creoles may be viewed as 
‘non-Romance’ from some points of view (see 2.2). 

I shall reserve further discussion of adstrata for chapter 8. In the next 
sections I shall review some of the main hypotheses about how substratum 
interference may have caused differentiation in the early history of 
Romance, first in phonology and then in vocabulary. In 6.4 we shall turn to 
the question of superstratum influence. 
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(a) Phonology 

(i) Celtic lenition 

One of the criterial features for Wartburg’s division of the Romance area is 
the lenition in the west of Latin intervocalic consonants, which regularly 
remain intact in the east. The voicing and weakening of intervocalic con¬ 
sonants can be seen as a ‘natural’ assimilatory process, with the voiced and 
open qualities of the surrounding vowels spreading to the intervening con¬ 
sonant; note that following voiced /r/ or /!/ has the same effect as a vowel 
on a preceding consonant. Some maintain that voicing was a (phonetic?) 
feature of some Latin varieties from early on, and some evidence is found 
in Pompeian graffiti - in an area where today intervocalic voicing is not 
found. 

It is possible that lenition was a general Indo-European tendency that 
surfaced most conspicuously in Celtic. Perhaps voice was not a distinctive 
feature in the Latin consonantal system, for which a tense/lax distinction 
was more fundamental. The opposite tendency to fortition, lengthening or 
doubling intervocalic consonants, was generally regarded as a popular 
expressive means of securing emphasis; it is found in Southern Italian 
varieties today (see below). 

In the Western Romance languages word-initial consonants were usual¬ 
ly treated like geminates: a link could be made with the postulated early 
Latin word-initial stress. 

Spanish reflexes of initial L [1] and N [n] (rather than [X] and [ji], form an excep¬ 
tion. In Catalan, on the other hand, initial L does become [/(], like the intervo¬ 
calic geminate; initial N gives [n], whereas nn > [ji] in the standard (but [n] in 
many dialects). 

In intervocalic position geminates are, however, normally simplified and 
voiceless obstruents are voiced, while voiced obstruents tend to open to 
fricatives (and in some cases eventually to fall): 

Intervocalic voiceless obstruents in Romance 

rTpa ‘riverbank’: Rum. ripd, Ital. ripa. Cat. (older), Ptg. riba. Span. [rriBa], French rive', 

VITA ‘life’: Rum. vita (’livestock’), Ital. vita. Cat. [vido], Ptg. [vidn]. Span, [bida], French vie; 
AMIGA ‘female friend’: Ital. arnica. Cat. [amiyo], Ptg. amiga. Span, [amiya], French amie. 

In Tuscan, and to a lesser extent in standard Italian, there is sporadic (usually 
lexically determined) voicing: this is particularly found with sibilants (for 
instance, in the suffix -ese). The voicing is sometimes explained in terms of 
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dialect borrowing (in northern Tuscany it is more frequently found) or of 
accent placement (occurring more frequently after the accent, as in Jegato 
‘liver’ < FICATU). However, the words that are affected do not fall within any 
obvious category, semantic or stylistic, and it has been suggested that in 
Tuscan the process can be related to the spirantization known as gorgia toscana 
(see below for ‘Etruscan spirantization’). 

Voicing or lenition has been described in terms of a chain shift, triggered 
initially in the Late Latin era by the simplification of geminates. It must be 
assumed in this scenario that the Rumanian loss of geminates dates from a 
later period. 

Geminate obstruents in Romance 

cOppa (alongside cupa) ‘cup’: Rum cupd, Ital. coppa. Span., Cat., Ptg., copa, French coupe; 
gOtta ‘drop’: Rum. gutd (‘gout’, perhaps from French), Ital. gotta. Span., Ptg., Cat. gota, 
French goutte [gut]; 

bOcca ‘mouth’: Rum. bucd (‘cheek’), Ital. bocca, Span., Cat., Ptg. boca, French bouche [buj]. 


The lenition process has also been ascribed to Celtic substratum influ¬ 
ence, even though word-initial consonants in intervocalic sentence posi¬ 
tion are not usually affected, in the way that they are in Celtic. 

Intervocalic sonorants are not usually affected by lenition. In Portuguese, 
however, n and l are lost: 

Portuguese loss of N and L 


MANU 

mao 

MALU 

mau 

SONU 

som [6o] 

SOLU 

so 

PANE 

pao 

PANNU 

pano 

SALIRE 

sair 

CABALLU 

cavalo 


This loss is attested in the earliest texts (though in the case of n the graphical 
tilde may have been an abbreviation for a pronounced nasal consonant, rather 
than denoting nasalization of the vowel). 

The Celtic substratum theory is eroded somewhat by the presence of some 
of the more conservative Sard varieties in the Western ‘leniting’ group, 
even though there is no evidence of Celtic populations in the island. 
Though some central varieties, like those of Bitti and Nuoro, do retain 
Latin intervocalic voiceless consonants, this is not so of most dialects. 
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Perhaps lenition here is of more recent origin. Indeed in modern 
Logudorian and Campidanian Sard varieties, as in south-central Italy 
(including the Roman dialect), it seems that a productive lenition rule 
operates, by which voiceless consonants tend to voice and open in all inter¬ 
vocalic positions, including in word-initial position. 

In the Italian dialects affected, the consonants may be lengthened (or 
doubled) especially in word-initial position, perhaps as a reaction against 
lenition. A process in the standard language known as ‘syntactic reinforce¬ 
ment’ or ‘doubling’ {rafforzamento or radoppiamento) ensures the protec¬ 
tion of word-initial consonants from lenition, especially after a stressed 
vowel and certain prepositions. Sometimes the gemination is lexicalized 
and represented in the orthography: 

(11) davvero ‘truly’, soprattutto ‘above all, especially’,/ra/tonto ‘meanwhile’, addio 
‘farewell’, neppure ‘yet’, diciassette ‘seventeen’. 

In Other cases, it is more or less obligatory in pronunciation: 

(12) Perche no?[mi\ ‘Why not?’. Cost fu [IT] ‘Thus it was’, Piu volte [w] ‘More 
times’. Dove set? [ss] ‘Where are you?’, come mai? [mm] ‘how on earth?’ 

The doubling can be distinctive in some cases: 

(13) lodato [lo.dato] ‘praised’ / L’ho data [lod.dato] ‘I have given it’. 

Northern speakers, whose dialects fall within the ‘western’ area, do not habitu¬ 
ally use double consonants in colloquial use of standard Italian and may omit 
to reinforce in this way. 

Explanations for doubling include the suggestion that it originates as con¬ 
sonant assimilation, with loss of a final consonant: 

(14) [am.me] ‘to me’< AD me, [soprat.tutto] ‘above all’< SUPRA-ad tottu. 

This sort of gemination seems to be regular in many Southern Italian 
dialects. Another suggestion is that the lengthening is effected by the pre¬ 
ceding accent. In some cases, as we have seen earlier, a stressed vowel is 
lengthened, and sometimes diphthongized: 

(15) [laito] < LATU; nuovo < NfSuu. 

However, in proparoxytones, the vowel lengthening is less evident in the 
stressed syllable, and sometimes, regularly in some dialects, there is conso¬ 
nant gemination instead: 

(16) pecora (not *piecora) ‘sheep’, but mdcchina < MACHINA. 
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There are examples of gemination even in some paroxytones, lending credence 
to this theory: 

(17) succolsugo ‘juice’< sucu; briitto ‘ugly’< brutu. 

We do not know whether the sort of lenition/fortition found in southern 
Italy and Sicily dates back to Proto-Romance, or whether it is a modem 
development. What it seems to show is some sort of neutralization of the 
length and voice distinctions in consonants. Sometimes there is alternation 
between devoicing and gemination: 

(18) Salento [krassu] ‘fat’ / [kkjuggrassu] ‘fatter’ (Ital. (piu) grasso), [tet/i] ‘ten’ / 
[kuddetji] ‘with ten’ (Ital. (con) died)- 

‘Western’ lenition, on the other hand, is envisaged as a chain shift, which 
avoids merger between long and short consonants, even at the risk of 
merging voiced and voiceless counterparts (and, at the extreme, of elimi¬ 
nating them altogether from the relevant lexical forms). 

One problem about postulating an east/west dichotomy on the basis of 
voicing is that in dialects straddling the central Pyrenees (Aragon on the 
Spanish side and Beam on the French) Latin voiceless consonants are reg¬ 
ularly retained. Some suggest that these conservative varieties resisted 
Western innovation, whereas others see the influence of Basque in a postu¬ 
lated early merger of geminates with single consonants, which precluded 
the chain shift. 

(ii) Celtic fronting 

Another feature that has been attributed to Celtic substratum influence is 
French front rounded [y], as the regular context-free reflex of Latin u. It is 
assumed that the Latin sound was a high back rounded vowel [u], like the 
reflexes in all the other major languages. 

We note, however, that Greek u ([y]) in some (early?) loanwords into Latin 
appears with the graphy u : short C, as shown by later Romance developments: 
e.g. 

(19) BURSA > Ital. borsa. Span., Ptg. bolsa. 

Elsewhere the accepted graphy (y) and pronunciation imitated the Greek, but 
in colloquial speech [i] was presumably pronounced: 


( 20 ) 


GYRARE > Ital. girare, French girer. 
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It is just possible that Latin u was in some varieties pronounced as [y], but in 
others as [u]. One could then conceivably view the [y] pronunciation as a fea¬ 
ture that persisted from Latin to some Romance varieties. 

The [y] pronunciation is not confined to Northern French, but is wide¬ 
spread in Occitan and North Italian dialects, as well as in some Portuguese 
varieties. It is absent from some Northern French varieties, in particular 
Belgian Eastern Walloon and some FrancoprovenQal dialects, where it is 
assumed that there was regression to [u] under the influence of neighbour¬ 
ing Germanic dialects. However, the existence of [y] pronunciations in 
some Germanic varieties has led some to see it as an areal feature and even 
to postulate a connection with genetic factors. 

The connection with Celtic is somewhat tenuous, especially as we know lit¬ 
tle about Gaulish Celtic. The evidence adduced points to Latin loanwords 
into insular Celtic (e.g. Welsh) in which u appears as [i]. Those who uphold 
the Celtic substratum hypothesis maintain that Gaulish continued to be spo¬ 
ken for some considerable time under Roman occupation, and that interfer¬ 
ence from the subject language produced a regional pronunciation in 
Gallo-Roman. We have little hard evidence about the persistence of Gaulish, 
nor indeed of the widespread use of Celtic throughout Roman Gaul. 

Breton, we recall, is generally thought to have been imported from Britain in 
the fifth century, although some do believe that it may also have some connec¬ 
tion with a Gaulish Celtic that may have survived in the far west. 

Moreover, we are uncertain about the chronology of the fronting of [u] to 
[y]. As in French the graphy u was always used, and as there was no con¬ 
textual differentiation, we have little information about pronunciation 
before sixteenth-century commentaries, which sometimes call it ‘Gaulish 
u’ and liken it to the Greek, German and Scottish sounds, commenting on 
its affinity with [i]. In the sixteenth and seventeenth centuries there was fre¬ 
quent confusion with eu, which was still diphthongal in some varieties in 
the earlier period, but then levelled to [ 0 ] in the standard. 

In English, French loanwords are pronounced with [ju]: 

(21) pure < pur. 

The English grammarian Palsgrave (1530) still likens French u to the 
sound in mew, though sixteenth-century native-speaker teachers of French 
to the English condemn this pronunciation, emphasizing that the lips 
should be rounded, without movement of the tongue. It is usually thought 
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that the [ju] always represents an English mispronunciation, with fission at 
segmental level, delinking of the labial element from the palatal. It is pos¬ 
sible, though, that French u was originally diphthongal (in tonic free sylla¬ 
bles but later spreading to pretonic and to blocked tonic syllables?). It 
would then have levelled to [y] in the same way as [eu] > [ 0 ]. 

The differential development of pulice (> pul’ce with syncope) ‘flea’, in 
French {puce) and Old Occitan (piuce) suggests that the French, and not the 
Occitan, fronting may have taken place after the vocalization of preconsonan- 
tal [1] to [u], thus dating it to after the twelfth century. However, if the French 
graphy u did at one time represent [ju] there would be no significant difference 
from the Occitan form. Besides, in the preconsonantal sequence [il] in e.g.fils < 
FiLius, the / fell, so perhaps the same was true in the sequence [ylj. Note that 
Norse loanwords, presumably dating from the tenth century, also show the 
fronting, and that [y] did not trigger at an early period palatalization of preced¬ 
ing [kj. 

One postulated effect of the fronting is that it triggered a ‘drag’ chain shift 
by which Gallo-Roman [o] was raised to [u] whenever it was not diphthon¬ 
gized : 

(22) Crsu > ours, ’*tOttu > tout. 

The graphy u is found in eleventh- and twelfth-century texts, ou being 
introduced in the twelfth century (presumably after diphthongal [ou] had 
differentiated to [eu]). Another suggestion (in 1949, by Haudricourt & 
Juilland, who eompare similar changes in Greek, Swedish and Portuguese) 
is that the fronting was conditioned by the existence of four degrees of 
aperture in the vowel system, which placed a burden on the back series 
(owing to the asymmetry in the articulatory space available at the front 
and at the back of the mouth). The fronting of the high back vowel is part 
of a ‘push’ chain shift that originated in the greater differentiation between 
[ 0 ] and [o], resulting in [o] raising to the [u] slot. 

Another effect of vowel fronting in Northern French is less often 
ascribed to Celtic influence, but deserves mention here. In French (but not 
the more northern dialects) Latin velar consonants palatalize before A 
(and also before AU, even though this eventually levels to [o:]). This wave of 
palatalization is not found in other Romance languages, except Western 
Rhaeto-Romance varieties, and has a different result from the ‘pan- 
Romance’ palatalization of velars (see 3.3). In French the velar + A palatal¬ 
ization yielded [f] and [(^] (or perhaps [c], [^], usually written ch, j). 
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becoming [J], [ 3 ], by loss of the delayed release feature, in the thirteenth 
century: 

(23) CANTARE > chanter [Jate] ‘to sing’; gamba > jambe [sab] ‘leg’. 

(Compare the dental affricate, usually written c or z, resulting from a 
voiceless velar + E, I.) 

In Francoprovenpal varieties dental affricates result from both waves of 
palatalization. In French, German and Arabic loanwords in velar + e, i partici¬ 
pate in the second wave of palatalization: 

(24) *skina > echine ‘spine’; meskin > (Old) meschin ‘young man’, with [tf] > [f] 

(cf Occ. esquina, Ital. meschino, both with [k], the latter borrowed into French 
in the seventeenth century as mesquin ‘shabby, paltry’). 

The second wave of palatalization seems to imply that French a at the time 
was a front vowel, whereas in the other languages its counterpart was a 
mid or back vowel. In tonic free syllables French a gave e by the time of the 
earliest texts, perhaps by diphthongization via [ae] or [aj] (see below). 

The affinity of French [a] to [e] was still stressed by sixteenth-century commen¬ 
tators. By the eighteenth century there is evidence, however, that lengthened 
[ai] had backed to [a], and the distinction is still maintained in some varieties: 

(25) patte [pat] ‘paw’ / pate [pat] ‘paste, pastry’. 

Although accepting that there must have been fronting of a to give the 
necessary phonetic conditions for palatalization in French, Haudricourt & 
Juilland maintain that the sufficient condition for the phonological change 
lies elsewhere. They point to the resolution in French of the Latin QU [kw] 
before a: 

(26) quand\)/i] / Ital. quando. Span, cuando [kw] ‘when’ 

as provoking a push-chain shift that resulted in the palatalization of c ([k]) 
in the same environment. It is, however, uncertain when the [w] element 
was lost in [kw]: in aqua ‘water’, for instance, traces remained for a long 
time (see Old French eve, eue, which eventually yields eaue, modern eau 
[o]). Certainly secondary [kw], like that of coagulare > cailler ‘curdle’, 
seems to behave in the same way as Latin QU. In modern French some 
words in qua- can still be pronounced with [k] or [kw]: these are usually 
Latinisms, like the thirteenth-century borrowing quadruple. Some six¬ 
teenth-century spelling reformers, however, advocate the omission of u in 
words usually spelt with qu. 
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Perhaps we should look elsewhere for the causes of the phonologization of the 
allophonic variation [f]+ a / [k]+ o. The levelling of the au diphthong, proba¬ 
bly to [o:], is attested in the graphy of the earliest texts, though these provide no 
evidence for palatalization (e.g. cosa < CAUSA), and probably by this period 
there had been raising or diphthongization of inherited central back vowels, so 
that the earlier allophony was no longer transparent. Moreover, the influx of 
loanwords in ca- began early (e.g., from Occitan in the twelfth century, cabrer 
‘to rear up’, and cadeau ‘gift, present’, originally meaning ‘capital letter’) and it 
was soon to become a flood. This may be a sign that [tf] was no longer at that 
time analysed as an allophone of [k]: the sequence [tfe], we recall, had become 
current with the fronting of tonic free a (as in chez < casa), though the sequence 
[tfi] was not well established at an early date. 


(iii) Etruscan spirantization 

I have already mentioned the localized Tuscan spirantization of intervo¬ 
calic consonants, which some believe to be in some way related to the leni- 
tion of more northerly Italian dialects. Its geographical distribution has 
prompted some to make a link with the extinct (and as yet undeciphered) 
Etruscan language - even though much of the area known to have been 
inhabited by Etruscans in classical times does not participate in spirantiza¬ 
tion. Usually the relative conservatism of Tuscan dialects, which form the 
basis of the Italian standard, is ascribed to the local population having 
learned Latin as a totally foreign language, and thus being less apt to intro¬ 
duce colloquial elements from their own tongue. Today, though, in Tuscan 
intervocalic voiceless plosives are pronounced as spirants, a feature that 
has acquired a degree of prestige. The process affects Ikl in particular, 
which is pronounced as [h] (which falls altogether in Lucca and Livorno): 

(27) dico [diho], poco [poho], la casa [lahaisa]. 

The sounds /p/ and It! are also affected in some dialects: thus rapa is pro¬ 
nounced [rap^a] or [ra^a], and stato as [staUo], [sta0o] or [staho]. In rapid 
Florentine speech all three voiceless plosives may be dropped altogether. 

Spirantization of Ikl is attested since the sixteenth century (for instance, 
in the spelling of the surname Cecherelli as Ceheregli), though this does 
not necessarily mean it was not current in popular pronunciation earlier. It 
has been claimed that Dante’s remark about the ‘turpitude’ of Tuscan 
speech refers to this unprestigious pronunciation. Some also believe the 
early spelling ch for [k] may signal a pronunciation [%] (but we recall that ch 



242 


The differences 


is normally used in Italian before e and i). In any case, although it is 
believed that Etruscan consonants were aspirated, there is no evidence of 
spirantization, nor of differential treatment of intervocalics in that lan¬ 
guage. Nevertheless the Etruscan substratum hypothesis still has advo¬ 
cates. 

(iv) Spanish /f/ > /h/ 

Equally speculative, and now almost wholly discredited, is the association 
with an Iberian substratum of the Castilian change of Latin f to [h], near¬ 
ly always, though not exclusively, in word-initial position. This change 
appears to have been originally confined to the region of Old Castile, but 
spread with this dialect to the standard. In the modern language the [h] 
has disappeared but it is still marked in the graphy: 

(28) FERIRE > herir ‘to injure’, facies > haz ‘surface’, fundu > Hondo ‘deep’. 

Although the modern standard pronunciation of/is a labial dental frica¬ 
tive, a regional pronunciation [4)] is frequent and possibly original. It has 
been suggested that this sound had the features [continuant], with weak 
approximation, and [round], and can be realized in some dialects sequen¬ 
tially rather than simultaneously, as [h*], as in some varieties today: 

(29) Peruvian Amazon juacil for fdcil, also Fan for Juan. 

Some credence is lent to this suggestion by the fact that in standard 
Castilian the change does not occur before a diphthong: 

(30) festa > fiesta ‘feast’, focus > fuego ‘fire’. 

In these cases the delinking of the [round] feature may have been inhibited 
by its not being able to form a diphthong with the following nucleus. In 
some dialects, though, the feature is lost before a rounded vowel: 

(31) juegoior fuego. 

Perhaps in the Castilian change, too, the split [round] feature was lost first 
before a round vowel, with subsequent generalization of its loss. 

In some older graphies it seems that/may have represented [h] whereas ff was 
used for [f|: the graphy h was introduced only in the late fifteenth century. 

It is quite possible that the feature splitting occurs most frequently in the 
acquired speech of those whose first language has no [4)] sound: this seems 
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to be true of Latin American varieties. That Basque does not have an /f/ 
phoneme is adduced as evidence that this was also so in pre-Roman 
Iberian (though in Basque, Latin f is usually represented in loanwords as 
[b] and not as [h'^). It has also been pointed out that the F > [h] change is 
attested in some Southern Italian dialects, where Iberian influence is 
implausible, and indeed it is not unusual for [(j)] to open to [h]. Moreover, 
as some later loanwords are affected, it is assumed by some that the change 
occurred long after Iberian had ceased to be spoken. It has been suggested 
that the Spanish phonemic split can be dated as late as the twelfth or thir¬ 
teenth centuries, when there was an influx of French and Occitan speakers 
into Spanish territory. 

(v) Rumanian ‘vowel centralization’ 

Rumanian has a neutral central vowel a [n] or [o] < Latin unaccented A, a 
development found in many Romance varieties. In Rumanian (as in 
European but not Brazilian Portuguese) it is also found in some accented 
positions: 

(32) par ‘pear-tree’< piru, Jan ‘lands’ < terrae. 

There is also another central unrounded front vowel, i (also written as a in 
older graphy and in certain contexts today). This is sometimes described as 
a high central vowel [i] or as a fronted back vowel [ui]. In some words the 
two central vowels are interchangeable, with colloquial speech preferring 
the former, but some minimal pairs testify to their distinct phonemic sta¬ 
tus: 

(33) vdr ‘cousin’ (yer in dialects south of the Danube) < (consobrinus) vErus / vir 
‘I insert’, of Slavonic origin. 

The provenance of the f vowel seems to be due to a whole range of more or 
less sporadic processes. These are said to include the raising and central¬ 
ization of non-high Latin vowels before a nasal, and the retraction of front 
vowels following so-called ‘velarizing’ consonants, especially /r:/: 

(34) Itnd ‘woor< lana (see Ptg. la [1b]), ripd ‘ravine’ < rIpa. 

Some seek the origin of i in Slavonic influence, as it occurs in a number of 
Slavonic loanwords. Others prefer to see the ‘velarizing’ tendency of 
Rumanian as stemming from the pre-Roman Thraco-Dacian substratum. 
There is little evidence to support this hypothesis, however, and it may be 
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preferable to view the central vowels as defaults to which other vowels 
revert when their context induces loss of distinctive features. 

What may also be possible is that d and t continue a former short [a] and 
[i], whereas modern [a] and [i] represent former long vowels. These latter 
could have resulted from coalescence of two like vowels: 

(35) casd ‘house’ / casa ‘the house’, where the latter incorporates definite article [I\a 
(see Ptg. casa (azut) [kazn] / [kazazul]); Cintarn ‘We sing’ / Cmlam ‘We were 
singing’ / CANTAMUS, CANTA[b]amus (see European Ptg. present tense ending 
[nmuj] < -AMUS / past tense ending [amuj] < -AVIMUS). 

In Other cases the long vowels could have resulted from lengthening in 
tonic syllables, a process that may have been inhibited by a following nasal 
or in some cases a preceding labial, sibilant or long r. The metaphonic 
effect of a following palatal glide seems sporadically to have provoked the 
change a>d: 

(36) mare sg. / mdri pi. ‘sea(s)’ < mare(s) (but marelmari ‘big’ < ?(Mas), mare(s)) 

Chronologically, spatially and morphologically there is some alter¬ 
nation between i and f, which is not incompatible with the view that the 
former was a longer version of the latter: 

(37) Old j// modem jt ‘and’ < sic; dial. :tc / standard zic ‘I say’ < dIco; vmdhme 
‘vein(s)’ < vena(e), vmd/vinde ‘I/he sell(s)’ < vendo/vendit. 

The fact that the r had an effect on retraction of a front vowel may indicate 
that it once had retroflex or perhaps uvular articulation (as it has today in 
some varieties, allegedly influenced by French). For the genesis of uvular r in 
other Romance languages see below. 


(b) Borrowing from substratum languages 

The proportion of any Romance lexicon that can be firmly derived from 
pre-Roman languages is very small, though some scholars have a tendency 
to ascribe a pre-Roman etymon to any word of unknown origin. Celtic sup¬ 
plies a number of common words to the Western languages: some of them 
may have already been borrowed into Latin (like the word for ‘cream’, 
CRAMA, which survives in France and north Italy). Most, however, are not 
attested in texts before the twelfth century at the earliest. Occitan seems to 
have retained the largest stock of allegedly Celtic words: clot ‘pit, grave’, 
lausa ‘roof slate’, comba ‘valley’, for instance, which also are known to 
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Catalan; borne ‘milestone’, caillou ‘pebble’ seem to be confined to the north 
of France, but cognates of boue ‘mud’ are found in the Alps, of creux ‘hol¬ 
low’ also in Occitan, and of mouton ‘sheep’ also all over the north of Italy 
and in Catalan. French berceau ‘cradle’ still has cognates in Occitan, 
Catalan and Portuguese, but the Old Spanish forms are no longer used. 

The Iberian languages have some everyday words unknown elsewhere, 
which are assumed to be of pre-Roman origin. One is the word for ‘butter’: 
Span, manteca, Ptg. manteiga (which has been borrowed by Catalan, 
Occitan and Italian dialects). Another is the word for ‘left hand’ (izquierdo, 
esquerdo), which is clearly related to a Basque word and which is also 
found in Catalan and Occitan. Portuguese appears to keep more of these 
words than Spanish: Ptg. carvalho ‘oak-tree’ remains, whereas Span, carvajo 
has been ousted by roble < robur. 

If it is hard to assign substratal origins to words in the Western 
languages, it is even more hazardous in Rumanian, as we have little knowl¬ 
edge of Thracian and Dacian. Normally it is assumed that where 
Albanian and Rumanian share common vocabulary of unknown origin, it 
must have survived from the pre-Roman era. Quite a few common words 
in modem Rumanian fit into this category: 

(38) abur ‘steam’, brad ‘fir-tree’, brtu ‘belt’, bucura ‘to rejoice’, buzd ‘lip’, cioc ‘beak’, 

copac ‘tree’, copil ‘chiXA’,fdrimd ‘crumb’, ghimpe ‘thorn’, groapd ‘pit, ditch’, 
rajd’duck’, vatra‘fireplace’. 

Very often the words that we believe emanate from the substratum lan¬ 
guages are commonplace, but rather quaint, words to do with traditional 
rural life. These words add a certain exotic flavour to each of the lan¬ 
guages, but proportionally they are so insignificant that they do not consti¬ 
tute any sort of communication barrier to speakers of the modern 
languages. 

6.4 BREAKUP OF THE ROMAN EMPIRE - 
SUPERSTRATUM 

(a) Germanic influence on French 

The most widely accepted theory about superstratum influence provoking dif¬ 
ferentiation in Romance concerns the role of the Franks in northern France. 
Within the present European French-speaking area different Romance vari- 
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eties underlie the standard: the linguistic line between the North {oit) and the 
South (oc) is unusually sharp, though in the East (‘Francoproven 9 ar) the 
isoglosses fan out in a fairly unpredictable manner (see 5.1). The linguistic 
divide is paralleled by several different traditional cultural patterns - for 
instance, in house-building, in agricultural techniques and in legal practice. 
The north is seen as influenced profoundly by Germanic cultural features. 

The Oc(citan) dialects are much linguistically closer to their Romance 
neighbours, and form part of Romania continua (see 5.2), and indeed, as 
we have seen (page 3), it has even been suggested that they are nearest to a 
postulated unitary langue romane from which all the Romance languages 
were deemed to have sprung. The Northern dialects, on the other hand, 
show radical differences, which have even, in the past, led to doubts about 
their Romance character. The Northern linguistic area has, since the 
Middle Ages, encroached on the Southern, especially in the West, where 
the isoglossic boundary plunges south to encompass Saintonge in the 
Oil orbit. Elsewhere in the south Occitan varieties survive, if somewhat 
precariously, even though French has been dominant since the sixteenth 
century, and today there are no adult monolingual Occitan speakers. 

(i) Phonology 

Some of the distinctive characteristics of French date, as we shall see, from 
the later Middle Ages. But the first French texts (from the ninth or tenth 
century) indicate that, phonologically at least, French had already devel¬ 
oped in a different way from most other Romance languages. The specifi¬ 
cally French phonological transformations are usually linked to a 
postulated ‘heavy’ word stress that had the effect of reducing atonic and 
lengthening tonic syllables. Thus, for instance, all vowels in atonic final syl¬ 
lables except a were lost. The sound a appears to have been reduced in pre¬ 
literacy times to [a], usually spelt e. This weakened further in the course of 
Old French and was virtually lost by the modern period (though the e 
instable or ‘mute e' does re-appear in some contexts). 

In the more conservative areas, central and southern Italy, Rumania and 
Sardinia, word-final e is maintained in some form, but elsewhere it usually 
falls: 

(39) LEVARE ‘to lift’: Sard hare. Rum. luare, Ital. levare, Occ., R-R, Ptg. levar. Cat., 

Span, llevar, French lever. 

However, there is more general retention of the distinction between final o 
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and a, morphologically so important for gender marking (see 1.3), and 
also to mark the distinction between persons 1 and 3 in many common 
verb paradigms (see 1.2). In many Romance varieties [a] eventually weak¬ 
ens to [a]; [o] often closes to [u], and it is regularly lost in Catalan, Occitan 
and Rumanian in absolute final position. 

Old French words thus were frequently monosyllabic, ending in a con¬ 
sonant; word-final obstruents seem to have been devoiced as in modern 
German: 

(40) NOUE > neM/‘nine’, grande > grant ‘big’, trabe > /re/‘beam’, longu > lone 
‘long’. 

The most salient effect of the postulated strong accent was, however, felt 
in the tonic syllables. It was only in the Northern French varieties that the 
Proto-Romance vowels [e], [o] and [a] were subject to regular diphthon- 
gization (for the pan-Romance diphthongization of [e] and [o] see 4.2). In 
the centre of the area only vowels in free syllables were affected: some seek 
to relate the process to Open Syllable Lengthening in Germanic languages. 
In some more peripheral varieties, however, especially near the frontier 
with Germanic (here we include also some Northern Italian and Rhaeto- 
Romance varieties), diphthongization can take place in both blocked and 
free syllables. The relative chronology of French sound changes places this 
wave of diphthongization later than the pan-Romance wave, as words in 
which syncope of unstressed vowels occurred, thus blocking the relevant 
syllable, are not affected by the second wave: 

(41) DEBITU > dette ‘debt’ but TfipiDU > tiede ‘lukewarm’. 

The specifically French diphthongization process can plausibly be 
linked to lengthening of the tonic syllable, producing a falling diphthong: 
[ee] > [ej], [oo] > [ow], spelt in early texts ei, ou. The effect on [a] is less 
clear: a diphthongal graphy appears only before a nasal (a context which 
blocks diphthongization in other cases): 

(42) MANU > main, modern [me], amat > aime, modern [em] (but CANE > chien, 
modem [/je], with ‘Bartsch’s Law’ breaking under influence of the preceding 
palatal < *chen). 

Elsewhere the graphy e is found: 

(43) TALE > tel modem [tel], gratu > gre modem [gre] (where the modem differ¬ 
ence in vowel quality is recent and depends on syllabic structure), caru > Old 
chier (with breaking), modem cher [JeR]. 



248 


The differences 


It is usually assumed that a diphthong [aj] or [ae] had rapidly resolved to a 
simple vowel, by mutual assimilation. 

Unlike the ‘pan-Romance’ diphthongization process, this French one does not 
seem to affect Latin checked monosyllables: 

(44) lAM > ja ‘already’ (see rem > rien ‘thing, nothing’, MEL > miel ‘honey’). 

The Old French e < A does not rhyme or assonat with other e vowels in 
the twelfth century, but is coupled with the e of loanwords: it is usually 
assumed to be longer than the other e sounds. ‘Bartsch’s Law’ breaking of 
this e to ie, however, suggests a [e] quality, whereas [e] < E in similar con¬ 
texts seems to be raised to [i]: 

(45) CERA > cire ‘wax’. 

The ei and ou diphthongs soon became oi and eu in most dialects and 
contexts, by flip-flop differentiation of the nuclear vowel from the glide, 
though in the west the glide tended to be lost, perhaps indicating a short¬ 
ening of the stressed syllable. Differentiation, which is attested from the 
eleventh century, is associated, in traditional studies, with retention of syl¬ 
lable length and the heavy accent which was thought to characterize more 
northerly and easterly dialects. 

(ii) Morphosyntax: German influence on French word order 

It has been implausibly suggested that certain features of French syntax - 
notably the obligatory use of subject pronouns (see 1.2) and postverbal 
negation (see 7.4) - are due to influence from German. Chronologically 
this is unlikely. More acceptable is the suggestion that some aspects of Old 
French word-order owe much to Germanic influence. 

In Old French, in declarative main clauses, the verb normally appears as 
the second constituent (verb-second or V2), an ordering which is current 
in Germanic languages. Where the subject occupies the first position, there 
is little difference between Old and modem French, which prefers SVO 
ordering. When, however, another element of the clause is topicalized by 
placing it first in the clause, then the lexical subject is inverted - that is, 
placed after the verb - while a pronoun subject is usually left unexpressed: 

Longtempsfu ly roys Elinas en la montaigne ‘For a long time was King Elinas 
on the mountain’; Ileuses paroles distrent lifrere de Lancelot ‘Such words 
uttered Lancelot’s brothers’; Atant regarda contreval la mer ‘Then (he) looked 
down at the sea’. 


( 46 ) 
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Although in the other early Romance languages examples of V2 ordering 
can be found, it is more regular in Old French. To invoke the influence of 
Germanic seems reasonable, though there are doubts about whether the 
relevant Germanic dialects were themselves V2 at the time of most influ¬ 
ence (between the fifth and eighth centuries). 

In Surselvan V2 ordering is regular, but here it is likely that influence comes 
from close modem contact with German. 

By early modern French the V2 ordering became less frequent and stylisti¬ 
cally marked: the language arbiters decreed in favour of direct, logical (i.e. 
SVO) ordering. One question we could ask is whether indeed in Old 
French the V2 ordering was not also a literary artefact, and whether less 
formal usage, for which we have little evidence, may have preferred a less 
Germanic-type ordering. 

Another word-order feature of Old French which may owe something 
to Germanic influence is interrogative inversion in polar questions, where 
the verb often occurs as the first constituent in the sentence (VI) with a 
postposed subject: 

(47) Durrad lifiz Ysaia vus tuz chanps e vignesl ‘Will the son of Isaiah give to you 

all the fields and vines?’ 

In fact, use of a lexical subject in such cases was quite rare, but a post- 
posed pronoun was more frequently overtly expressed than in declarative 
V2 sentences. By the fifteenth century inversion of a lexical subject in 
polar questions was apparently archaic; for modern so-called complex 
inversion see 7.4. 

In the other Romance languages, VI ordering is not uncommon, so 
much so that some claim that the basic order is VI. Usually, however, this 
ordering implies not so much interrogation, as a degree of topicalization. 
We may assume that intonation has always been, as now, the principal 
means by which a polar question is distinguished in Romance from a state¬ 
ment. It is quite possible that French borrowed from Germanic the associ¬ 
ation of VI ordering with interrogative function. 
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(iii) Germanic vocabulary 

In the Western languages there is a not negligible proportion of common 
words of Germanic origin that have survived from the Roman and post- 
Roman periods. French, as one might expect, has the largest share, and 
Portuguese the least, with Italian high on the list of recipients. Some words 
were borrowed into Latin and spread through the Empire: the word for 
‘soap’ SAPONE is pan-Romance, and hints at the washing habits of Romans 
and Germans. Others are widespread but did not reach Rumania: the 
words for ‘war’, ‘to win’, ‘to guard, ward’, ‘to equip’ all speak of the mili¬ 
tary prowess of the Germanic invaders: 

(48) *werra: French guerre, Ital. guerra (probably borrowed into the other lan¬ 
guages); *waidanjan\ French gagner, Ptg. ganhar, Ital. guadagnare. Span. 
guadaiiar .; *wardon ‘to ward, guard’: French garder, Ital. guardare. Span., Ptg. 
guardar, *warnjan\ French garnir, Ital. guarnire. Span., Ptg. guarnir. 

In Other cases French is differentiated from the other languages by its 
choice of German word rather than Latin: 

(49) choisir ‘to choose’ (*kausjan) / Ital. scegliere < ’"ex-eligere, Span, escojer < 
*EX-coLLiGERE, Rum. alege < allegere; French sale ‘dirty’ (salo)! Ital. 
sudicio. Span, sucio < suciDU; saisir ‘to seize’ (*sakjan) / Ital. cogliere. Span. 
eager < colligere etc. (see French cueillir ‘to pick, to pluck’). 

However, the Germanic input into the French vocabulary remained mar¬ 
ginal, and mainly confined to more specialist terms, so that they do little to 
alter the overall configuration of the lexicon. 

It has been maintained that the relative paucity of Germanic loanwords 
in Rumanian is evidence that the language originated south of the 
Danube, and migrated north to present-day Rumania during the Middle 
Ages. However, others claim that there is a considerable number of such 
words that have survived the centuries from the earliest period: for most of 
these the etymological origin is contested. Nasture ‘button’ is one of the 
few that is generally recognized as Germanic, but it could have been bor¬ 
rowed via Italian nastro ‘ribbon’. 


(b) Arabic influence in Spain and Portugal 

The contact with Romance of the Moorish conquerors of most of the 
Iberian peninsula, from the eighth century until their final defeat in 1492, 
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was quite different from that of the Germanic invaders of the Romance 
Empire. For one thing, Arabic culture was far in advance of that of Dark 
Ages Western Europe, and the invaders were not so eager to adopt the lan¬ 
guage of the conquered people. That they did do so to some extent is 
shown by the evidence of the Romance refrains (harga’s) of muwassah 
poems, written in both Arable and Hebrew, of whieh we have evidenee 
from the eleventh century. However, some scholars cast doubt on the lin¬ 
guistic authenticity of some of these refrains: because of the difficulties of 
transcription from Arabic and Hebrew script, interpretation is often haz¬ 
ardous, and it is possible that they are little more than Romance-like ‘hey- 
diddle-de-dee’ nonsensical refrains. 

In the past, there has been a resistance on the part of Spanish and 
Portuguese writers to admitting any profound linguistic influence on the 
Northern dialects that spread south with the Christian Reconquest. Many 
of the characteristics that the layman may associate with Arabic, like 
Spanish velar fricative jota, are of comparatively recent origin. Very little 
in the modern languages, apart from borrowings, can reliably be attributed 
to Arabic: one exception is the suffix -/, as in alfonsi. 

More recently, politically correct attitudes prefer to envisage a symbiot¬ 
ic, rather than a conflictive, relationship between the medieval languages 
of the peninsula. It is quite probable, on historical grounds, that 
Portuguese was less long in contact with Arabic than Spanish varieties 
were. Certainly we know that the Spanish Jews, an influential and cultured 
group until their expulsion in 1492, were proficient in Spanish, and often 
acted as interpreters between Arabic and Spanish speakers. There is some 
doubt, however, whether the Muslim moriscos, mainly farmers, who were 
in their turn expelled in the early seventeenth century, did ever know much 
Spanish. 

Before the Reconquest, a conservative Romance language was used in 
Moorish Spain, by Christians, Jews and possibly by Muslims of Iberian 
stock (muwalladat). This variety is sometimes known as Old Andalusian, 
or more often as mozarabic, from the name (musta’ribun) given to 
Christians under Muslim rule, though there may have been very few of 
these left by the twelfth century. 

Certain features of this variety were conceivably influenced by Arabic, 
and it has been suggested these have survived into modern Andalusian 
Spanish. These concern mainly the pronunciation of sibilants and dental 
fricatives. In Castilian Is/ (s mainly < Latin s) is apico-alveolar, while /0/, 
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earlier [ts] (c, g < Latin+^’') is dental (and usually coronal): for devoicing of 
former voiced fricatives see 7.2. In Andalusia by the fifteenth century the 
two series seem to have begun to merge, with the spelling g (ceceo). It is 
assumed that Is/ had dental articulation in this area. In coastal areas today 
a sound similar to [0] is used for both series, whereas elsewhere, including 
Seville, a dental [s] (seseo) is current. This is the pronunciation which was 
purportedly exported to the Canaries and America. In much of Galicia 
and Portugal, though not in standard Portuguese, a distinction is main¬ 
tained between the two series. There is no good reason, however, to link 
the merger of the two series with Arabic influence, as northern France 
underwent the same change, probably by the thirteenth century. 

More reliable evidence of the influence of Arabic on the Iberian lan¬ 
guages is found among loanwords. Many Arabic words, as we have seen, 
entered the rest of the Romance languages via Spanish, usually accompa¬ 
nying some cultural innovation. There was some reaction in early modern 
standard Spanish against the use of such words, so that more have sur¬ 
vived in the Portuguese basic vocabulary than in the Spanish, though in 
both languages the proportion remains small; 

(50) Ptg. alfaiate ‘tailor’ / Spanish sastre, borrowed from Old French sartre < sar ¬ 
tor (but Old alfayate). 

Most of the common words that are different from those of other 
Romance languages are shared by both languages; 

(51) Span, aceite, Ptg. azeite ‘oil’; Span, aldea, Ptg. aldeia ‘village’; Span, almacen, 
Ptg. armazem ‘store’; Span, hasta (Old fata), Ptg. ate ‘until’. 


(c) Slavonic influence on Rumanian 

By the seventh century there was extensive Slavonic settlement in former 
Eastern Romance territory, and it is claimed that the newcomers north of 
the Danube adopted the local Latin, with a long period of bilingualism. It 
is certain that loanwords and cultural history provide evidence of close 
contact, but there is dispute about how far Slavonic influence accounts for 
the idiosyncratic features of Rumanian. 

I have already discussed, as a possible substratum phenomenon, the 
centralization of Rumanian vowels that some attribute rather to Slavonic 
influence. The frequent palatalization of Rumanian consonants may more 
plausibly thus be linked, especially in the case of the realization of the 
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masculine noun plural ending, and the person 2 verb ending, i, as palatal¬ 
ization of the preceding consonant. In many Rumanian dialects, too, there 
is Slavonic-like jodization of initial vowels. 

Some attribute to Slavonic pronunciation the ea result of Proto- 
Romance [e] when a (or sometimes e) appears in the final syllable: 

(52) neagrd < NfORA ‘black’, ceard < CERA ‘wax’. 

However, the parallel diphthongization of o is less likely to be due to 
Slavonic: 

(53) poartd < porta, coadd < CM}D\,floare < fl5re. 

Moreover, the diphthongization is found also in dialects south of the 
Danube, suggesting an early date for the change. It is possible that it is 
connected to the metaphonic changes of tonic vowels discussed earlier as a 
possible Proto-Romance development. In Rumanian, as in Portuguese, 
there may have been raising and closing of a metaphonic diphthong under 
the influence of a following high vowel, but lowering in other cases: 

(54) Ptg. porco [o] / porca [o]; Rum. pordpoarcd < porcuZ-a ‘boar’/’sow’. 

In Rumanian the lower vowel, perhaps lengthened under stress, seems then 
to have opened further, anticipating at the end the timbre of the following 
low vowel. The stem alternation frequently marks nominal gender distinc¬ 
tion and so is firmly rooted in the grammar. 

Other phonological features of Rumanian are clearly Slavonic in origin, 
in particular the sounds, like h ([h] or [x]), and sound sequences, like si, 
that are found mainly in Slavonic loanwords: 

(55) hrand'tood’, a se odinlti ‘to rest’, slab ‘poor’. 

Other features often linked to Slavonic are the abundant use of reflexive 
forms and the frequency of clitic doubling. These are, however, found in 
other Romance languages (see below), though Slavonic influence may have 
reinforced them in Rumanian. 

There is no doubt that there has been copious borrowing from Slavonic 
into Rumanian, though the vocabulary of any text still remains prepon¬ 
derantly Romance. We have already mentioned the influence of Slavonic 
on the composition of Rumanian numerals; similar loan-translation 
effects are found in, for instance: 


(56) 


lume < LUMEN ‘light’ and ‘world’.ya/d < facies ‘face’ and ‘person’. 
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Slavonic derivational suffixes like -ac, -nic, -u^ are particularly common. 
Examples of everyday words of Slavonic origin include: 

(57) boald ‘illness’, bumbac ‘cotton’, ceas ‘hour’, ciocan ‘hammer’, clopot ‘bell’, 
cohort ‘to go down’, co^ basket’, deal ‘hill’, drag ‘dear’, gata ‘ready’, gdsi ‘to 
find’, hotdrt‘to decide’, lovi ‘to hit’, mindru ‘proud’, nevmta‘wife’, omori‘to 
^i\\\prost ‘stupid’, raciot‘war’, ‘disgust’, trdi'to live’, veac ‘century’, 

‘tooth’. 

We may mention here common words that penetrated into Rumanian from 
Magyar: 

(58) gfnnf‘thought’.ye/‘manner’ 
and from Turkish: 

(59) chel ‘bald’, dayman ‘enemy’. 

6.5 SYNTACTIC DIFFERENTIATION IN THE 

EARLY MODERN PERIOD 

In chapter 3, we considered the question of the timing of the death of 
Latin, and I suggested that this was intimately bound up with the question 
of when the Romance languages differentiated. As long as the vernaculars 
were viewed as merely variants of the prestige language, Latin, then there 
was no clear perception of the separateness of the Romance languages. We 
have already seen that nonstandard regional varieties of Romance tend to 
merge into one another in a spatial continuum, with catenate mutual intel¬ 
ligibility. The standards, on the other hand, are clearly demarcated, and 
indeed often deliberately emphasize differences between themselves and 
their neighbours. Thus we may wish to date the definitive differentiation of 
the Romance language to the period of standardization, which in most 
cases coincides with the era of the Renaissance, which, by its own lights, 
was innovative and forward-looking. It can be claimed that there was at 
the time a new insight into history and the processes of change which 
arose primarily from the consciousness of language change over time, as 
distinct from that of spatial and social variation. The perception of the 
vernacular, on which intrastate communication depended, as labile and 
shifting was disconcerting, and so every effort was made to elaborate stan¬ 
dard, fixed codes. 

Many of the differences between the early Romance varieties that we 
have already discussed in terms of stratal influences were phonological, 
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and probably even, initially, merely phonetic. They may not have seriously 
hindered comprehension between Romance speakers, as superficial, natur¬ 
al and easily formulable variation. Vocabulary also varied from region to 
region, but often less frequent synonyms persisted in one area that made 
the usage of another area understandable. There was less variation in 
grammatical structures before the post-medieval period. 

We have ample evidence of medieval writers operating in different 
Romance varieties, as well as in Latin, and little inkling of any serious 
communication barriers. Even though feudal society confined much of the 
population to one place, there were always pedlars and strolling minstrels, 
who presumably had to be proficient in a number of local varieties, and 
who could spread features from one to another. We must not forget either 
that the Crusades meant that speakers of different Romance varieties min¬ 
gled together and, apparently, forged a common lingua franca of clearly 
Romance character. 

It is true that by the thirteenth century we become more and more con¬ 
vinced that many Romance speakers found learning Latin a hard task, and 
that recourse had to be made to the vernacular to ensure comprehension 
(especially in official documents). As we have seen, it is about then that dif¬ 
ferent language names began consistently to be applied to different regional 
varieties. That translations from the Latin into French were commissioned 
on royal authority in the fourteenth century is surely evidence of the grow¬ 
ing perception that these were substantively different languages, as well as 
of a deliberate attempt at status planning (normalization). Codification 
(corpus planning or normativization) did not really get going till the six¬ 
teenth century, when the new technology of printing was set to make the 
written word more accessible to a wider public. By the seventeenth century 
in Western Romance-speaking Europe, individual vernaculars, now linked 
to state power, were replacing Latin in nearly all its functions. But Latin still 
retained prestige as a cultural and religious language. 

I have suggested earlier that in some cases the very recourse to Latin and 
to other Romance languages for new vocabulary may have hastened 
phonological differentiation by introducing new phonological contrasts 
where previously there was merely allophonic variation. Against this may 
be placed the consideration that Latinized forms may have re-introduced, 
at the synchronic level, into speakers’ consciousness the morphological 
link between forms that were related etymologically, but whose phonologi¬ 
cal relationship had become obscured by sound changes. 



256 


The differences 


Another source of early differentiation in the texts may be an effect of 
the differing prestige of competing varieties within a Romance-speaking 
community. For instance, it can be argued that early Old French literary 
texts exhibit more Germanic-like traits than later ones. The very earliest 
texts appear in bilingual contexts or originated from bilingual areas (like 
England). It may be that the more Germanic syntax that is detected in 
these texts is not characteristic of Old French as an entity, but of a certain 
prestige variety. The changes that apparently took place during the Middle 
French period may reflect, then, not a change over time, but a switch in 
prestige between different varieties. Similarly, in Italy an early prestige 
Sicilian variety gave way to a more Tuscanized version represented in the 
great fourteenth-century writers, and the Galician variety used by 
medieval Spanish lyric poets was ousted by more politically prestigious 
Castilian. A parallel can be drawn with modern Rumanian, which opted 
more towards Romance models than towards Slavonic, in the process of 
standardization. 

Some of the syntactic changes that differentiated the Romance lan¬ 
guages in more modem periods will be discussed in the next section. Here 
we should emphasize that it is not always only standard varieties that par¬ 
ticipate in these changes, but also sometimes non-standard varieties. It is 
difficult in these cases to unravel the influences at work, but one must 
assume that nonstandard varieties tend in the modern era to be drawn into 
the orbit of their ‘umbrella’ or ‘roof’ language (see 5.5). This is particular¬ 
ly so in the last century, with the advent of universal education, which is 
almost exclusively conducted in the state standard language. Also to be 
taken into account is the koineization of regionally grouped dialects, in 
tune with assertion of separate identity: this is most evident in northern 
Italy and, to a lesser extent, in southern France. For more discussion of 
bilingualism in Romance-speaking countries see chapter 8. 

Many of the syntactic structures differentiating the modern Romance 
languages date back only to the early modern period, when a choice seems 
to have been made between earlier variants, with each language opting, 
presumably, for the most common, or the most prestigious, in local usage. 
In the next sections I shall concentrate on some aspects that have received 
particular attention from linguists because of their theoretical interest. 
These concern past participle agreement, clitics and determiners. 
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(a) The compound perfect and past participle agreement 

In the Introduction we considered auxiliary selection in the compound 
perfect, and in chapter 3 the genesis of this form. Here we shall consider in 
more detail the question of agreement of the past participle with the argu¬ 
ment of the verb in such forms. Everywhere when a B£:-verb is used, as in 
the passive and with unaccusative or pronominal verbs, the past participle 
can show gender and number agreement with the subject, just like any 
predicative adjective with a copula: 

(60) Ital. Maria e amatalandata; French Marie est aimeelallee; Span. Maria es 
querida (but ha ido) ‘Mary is loved / has gone’. 

Even though the standards decree that such agreement is obligatory, it is to be 
noted that spoken varieties not infrequently ignore the ruling, especially when 
the participle precedes the subject, or when they are at some distance one from 
the other. Non-agreement is attested in medieval texts, especially with coordi¬ 
nated subjects: 

(61) ... Oil les hystoires et lifait de touz les rois sont escrit ‘. where the histories 
and the doings of all the kings are written’. 

(Grandes Chroniques de France, c. 1270) 

In some North Italian dialects agreement occurs only when a subject clitic is 
used: 

(62) Cremona La roba e rival / La roba I’e rivada ‘The stuff has arrived’. 

In those languages in which a be auxiliary can still be selected for the com¬ 
pound perfect there can also be agreement of the participle with the object 
when a have auxiliary is selected. In those Catalan varieties that do not 
retain the auxiliary selection feature, there is still limited and optional past 
participial agreement. 

In all the languages for which we have medieval evidence, past participle 
agreement with the object of the verb is attested, although somewhat 
irregularly. This possibility ceased at about the same period (between the 
fifteenth and seventeenth centuries) as the possibility of selection of the be 
auxiliary in Spanish and Portuguese. We must assume that these two fea¬ 
tures are related and that the process of standardization led to the fixing of 
usage which had previously been variable. 

In Spanish it is tempting to see the features as bound up with the spe¬ 
cialization of haber as an auxiliary, and the comparatively tight cohesion 
between the finite and the participial forms in the Spanish compound past. 
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In standard Catalan, where there is similar cohesion, the retention of 
optional agreement may be connected with the concern of nineteenth- 
century language arbiters to differentiate their language from Castilian, 
and to embrace as many dialectal varieties as possible. In modern 
Portuguese, where ter has virtually replaced haver in its auxiliary function, 
the compound tense is little used and not strictly comparable with that of 
the other Romance languages (see 3.3). There is comparatively little cohe¬ 
sion between the auxiliary and the participle, however. The pronoun object 
can be intercalated, even though there is no agreement: 

(63) Tenho-as escrito ‘I have written them’. 

As already pointed out (page 20), a link has been made between retention 
of the adverbial clitic (< inde ‘thence’) and of the possibility of auxiliary 
selection. These can be seen as conservative features (see below). 

In the medieval languages past participial agreement is understood as a 
remnant of an original construction in which the have functioned as a full 
lexical verb and the participle as a passive (adjectival) form qualifying the 
object noun phrase (see 3.3): 

(64) Habeo litteras scriptas ‘I have letters written’. 

In some more conservative Romance varieties today such agreement is still 
possible with a postposed object: 

(65) Occ. (Western Languedoc) Ai venduda t ola vaca ‘I have sold the cow’; Balearic 
Cat. He vista la mare ‘I have seen the sea’. 

Agreement is normal (though not always found in colloquial use) in Spanish 
and Catalan with the verb tener ‘to have’, which is distinguished from the aux¬ 
iliary hater. 

(66) Tengo escritas las cartas; Tine escrites les cartes. 

In the medieval Romance languages the object noun phrase was frequent¬ 
ly, but not invariably, intercalated between the finite form and the auxil¬ 
iary. Agreement usually was marked, though not always. Attempts to 
formulate watertight rules on the basis of medieval attestations have not 
succeeded: one must assume that there was variation in usage, and that 
non-agreement tended to signify grammaticalization of the auxiliary + 
participle sequence, or, possibly, as in some modern uses, emphasis on the 
action rather than on the goal of the action. 

In modern standard Italian agreement is obligatory only with a third- 
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person pronoun object, which must be proclitic to the auxiliary, but it can 
be found in other contexts: 

(67) Le ho scritte / Ho scrltto le lettere (infrequently and very formally scritte) / Le 
lettere die ho scritto (rarely scritte). 

In dialectal usage agreement is not infrequent with a following object, but most 
often a pronoun copy is inserted before the verb: 

(68) Hai vendute le uova / Le hai vendute le uova ‘You have sold the eggs’. 

In Italian child language, we note, the participle tends to be acquired early with 
past reference and often agrees with its object: 

(69) Messa lapalla ‘put the ball’, i.e. ‘He/I put the ball’ or ‘the ball was put’. 

Agreement is also possible, in the standard, with the adverbial pronoun ne, 
but rarely found with first- and second-person pronoun objects: 

(70) Ne ho scritte due ‘I have written two of them (letters)’ / Mi ha reduto ‘He has 
seen me (female)’. 

Standard Italian also has agreement of the participle, in an absolute construc¬ 
tion, with a postposed object noun phrase: 

(71) Conosciuta Maria ... ‘Mary known ... ’ 

In modern standard French the schoolbook rule states that the participle 
agrees with any preposed object - not only the clitic but with a fronted lexi¬ 
cal noun phrase: 

(72) Je les ai ecrites; Les lettres que jai ecrites; II m 'a vue. 

However, grammarians are uncertain about agreement with the adverbial 
pronoun en, and usually advise against: 

(73) J'en ai ecrit(es) deux. 

The rules were first formulated in the sixteenth century, and refined in 
the seventeenth, after heated discussion. The principle followed was, on 
the whole, that only a preposed direct (accusative) object could trigger 
agreement. Much depends on the analysis of such pairs as: 

(74) les belles annees qu’il a vecues I les soixante-quinze annees qu’il a vecu ‘the 
happy years’ / ‘the seventy-five years he lived’ 

where the latter, as distinct from the former, is seen to involve a temporal 
expression rather than an argument of the verb. In pairs like: 
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(75) Elle s'est servi(e) delasoupe 

the form with agreement would be an inherent pronominal verb {se servir 
de ‘to use’) which, controversially, can be said to have an obligatory direct 
object se (absorbing accusative case), whereas, without agreement, the se is 
interpreted as an indirect (dative) object (‘She served soup to herself’). 
Even more contentious is the status of agreement in verb sequences. To 
trigger agreement the noun phrase has to be the object argument of the 
participle, rather than of another verb in the sequence. Thus: 

(76) la fable qu'il a eu(e) a reciter 

with agreement, is interpreted as derived from II a eu la fable a reciter ‘He 
had the fable to recite’, and without, from II a eu a reciter la fable ‘He had 
to recite the fable’. 

For most verbs modern spoken French has no overt mark of participial agree¬ 
ment, and violation of the schoolbook rule is so frequent that even the educa¬ 
tion authorities ‘tolerate’ non-use of agreement. 

In Standard Catalan agreement is found optionally with the clitic pronoun, 
including the ‘adverbial’ nelen'. 

(Jl) Jo n’he menjada una ‘I have eaten one of them’. 

Number agreement is less frequent than gender agreement, with other clitics. 

In some Rhaeto-Romance varieties (mainly those of the Engadine), in 
some Gascon Occitan varieties and in the Bastia dialect of Corsica agree¬ 
ment can also be triggered by a dative clitic object: 

(78) Lower Engadine Je tilla n'lia dit(ta) co chi sta ‘I told her how things are’. 

The version with agreement is interpreted as calling more attention to the 
addressee. 

If we view the present-day Romance usage from a historical standpoint, 
we can easily account for differences in terms of relative inertia and resis¬ 
tance to simplification: the more conservative varieties retain some relics 
of the adjectival status of the participle modifying the noun phrase which 
was the object of the HAVEyQvh. Even in the medieval languages agreement 
with a preceding noun phrase was more frequent than with a following 
one: in general, agreement is more regular from left to right than vice 
versa. What is interesting is that agreement with a preceding clitic is more 
persistent than with a full noun phrase. It is tempting to see this as due to 
pragmatic considerations, in that the antecedent of the object clitic is not 
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always easily recuperable without additional grammatical information. It 
may also be connected with a restructuration which turns the clitic into a 
morphological affix to the verb. 

At a synchronic level it is rather more difficult to account for all the 
vagaries of Romance past participle agreement within a single model. 
Most often links are made to agreement in the passive and unaccusative 
constructions: the object can be seen as underlyingly attached to the partici¬ 
ple, and raised first to the participle’s subject position and then further 
leftwards, leaving a trace in the preparticipial position, which can trigger 
off agreement in the participle, similar to subject-verb agreement. 


(b) CUtics 

(i) Adverbial clitics 

Some of the modern Romance languages use clitic forms for locative refer¬ 
ence. The most usual are derived from Latin ibi ‘there’ and inde ‘thence’, 
to give: 

(79) Ital. VI, ne, French y, en, Occ. i, entne. Cat. hi, enine, Sard hi, nde, R-R (Friuli) i, 

indl-nt. 

Similar forms were found in Old Spanish and Portuguese, but ceased, 
except in Aragonese, to be used by about the sixteenth century, when 
stressed locative adverbials were preferred. The [j] of Spanish hay ‘there 
is’, and perhaps of person 1 forms like soy ‘I am’, is a relic of the ibi clitic. 
Relics are found in other Romance varieties, but there is no evidence for 
the clitics in Rumanian (the preposition inde ‘between, towards’ probably 
derives from in+de). 

In the Italian of Tuscany these forms began to replace the persons 4 and 
5 object pronouns no and vo from the earliest period. In the standard, ci 
replaced neini with person 4 reference by the seventeenth century. It is 
thought that the ci form, which also has locative meaning almost inter¬ 
changeably with vi, is also originally an adverbial, perhaps for ecce-hic 
‘behold here’ or hince ‘hence’. The equivalent in Sard is nke, which lends 
credence to the latter etymology. 

From the earliest texts the adverbial clitics could refer not only to loca¬ 
tion but also to prepositional phrases that were only vaguely directional, 
typically with AD ‘at, to’ or de ‘of, from’. The earliest Romance occur- 
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rences of both ibi (jv) and inde {int) reflexes are attested in the (ninth cen¬ 
tury?) Strasburg Oaths; 

(80) .. .in nulla ajudha... nun It iv er ‘. in no help... to him in that respect I shall 
be’;... neuls cui eo returnar int pois ‘... no-one whom I can turn back from it’. 

The Romance reflexes of inde increased in importance with their reference 
to the partitive, which typically was compounded with de, and gained 
ground in some languages in the modern period (see below). It has been 
claimed that with unaccusative verbs in Italian ne can function as a subject 
pronoun: 

(81) Ne arrivano ‘Some arrived’; cf. Ne arrivano quattro ‘Four of them arrived’. 

Reflexes of ibi and inde used pronominally in the modern standard lan¬ 
guages tend to be confined to reference to inanimates. Nevertheless dative 
pronouns are frequently replaced by ibi forms, especially in nonstandard 
varieties, while inde forms may displace prescribed de + disjunctive pro¬ 
noun sequences. 

In some varieties (as in East French dialects) ibi forms can be used as neuter 
accusative pronouns. 


(ii) Animate and inanimate pronouns 

Language standardizers tend, in relation to pronominal reference, to 
emphasize the semantic distinction between animates and inanimates. The 
specialization of adverbial clitics in the latter role is an example of this. In 
standard French, for instance, a distinction can be made between: 

(82) J’y pense / Je pense d lui ‘I think of it / of him’; J'en attends la fin / J’attends sa fin 

‘I await its/his end’. 

There are, however, inconsistencies in usage. 

Another notable example of the animate/inanimate distinction is that of 
Spanish leismo, which can be linked to the ‘prepositional accusative’ (see 
3.3). Here an originally dative pronoun le is used for the accusative in ref¬ 
erence to a human referent, while the originally accusative forms lo, la are 
reserved for nonhuman referents. American Spanish varieties (with the 
exception of those of the Ecuador highlands and of Paraguay) rarely 
make use of this distinction, nor is it found much in southern Spain, but it 
is widespread in the north of Spain. There is evidence, too, of use of 
Portuguese Ihe in the same way, in Brazil as well as in Europe. In some 
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parts of northern Spain the dative form is extended to refer to all ani¬ 
mates, and sometimes to inanimate count nouns. In the Madrid area there 
is (nonstandard) extension also of the specifically feminine accusative 
form la to use as a dative in reference to human females. This tendency 
suggests that gender differences override case distinctions in this variety. 

Similar uses are attested in medieval texts, though because of the fre¬ 
quent reduction of all the singular third-person pronouns to / it is not easy 
accurately to judge how common it was. In the modern Spanish standard 
language, on the other hand, the forms lo, la and le are clearly differentiat¬ 
ed. Some early grammarians have attempted to prescribe modem use by 
reference to etymology and to traditional case distinctions, and have there¬ 
fore condemned leismo. However, in the modern standard, it is accepted as 
a feature of prestigious, polite usage (even though the Spanish Academy is 
still wary of use of the etymological dative plural les as an accusative). 

There are traces in the other Romance languages of a tendency to 
equate pronominal dative case with animate reference. This has been 
explained in terms of relative involvement in the activity denoted by the 
verb: an animate participant of an action is more likely to play the role of 
recipient than of patient, and so typically the dative form will more often 
refer to an animate than will the accusative. 

(iii) Clitic climbing 

Reduction, over time, in the frequency and range of past participle agree¬ 
ment in the compound tenses seems to have involved an increase in cohe¬ 
sion between the relevant verb forms. The opposite trend may be discerned 
in another type of two-verb construction, where the second verb is an 
infinitive. 

We saw, in 4.3, that there has been, in all languages except Rumanian, an 
increase over time in the use of infinitival complementation. In resultant 
finite verb + infinitive sequences, where the two verbs share the same sub¬ 
ject, a clitic pronoun object of the infinitive often attaches clitically to the 
(higher) finite verb: this process is called clitic climbing or promotion. It is 
assumed that this is an effect of the close cohesion (clause union or ‘mad¬ 
ness’) between the two verbs, which are treated syntactically as a single 
unit. 

Clitic climbing is found from the earliest period in the history of 
Romance, and it is, in most languages, still a more colloquial construction 
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for some sequences. With causatives (‘factitives’) and perception verbs 
(where the clitic can refer simultaneously to the object of the finite verb 
and the subject of the infinitive) climbing is the rule nearly everywhere: 

(83) Ital. L’ho fatto fare / Gliel’ho fatto fare / Lo vedo venire; French Je I’aifaitfaire I 
Je le lui ai fait faire I Je le vois venir; Span. Lo hice hacer / Se lo hice hacer I Lo 
veo venir; Ptg. Fi:-o fazer / Fiz-llo hacer I Vejo-o vir ‘I had it done’ / ‘I had him 
do it’ / ‘I see him come’. 

Although Rumanian does not use an infinitive in these constructions, the 
clitic still attaches to the higher verb: 

(84) Am facut-o / Am vdzut-o sd vind ‘I made/saw her come’. 

In the modern languages, the clitic can attach to the infinitive when its subject is 
difierent from that of the higher verb, so that a semantic distinction is possible: 

(85) Span. Hizo decirmelo ‘He had them tell me it’ / Me lo hizo decir ‘He made me 
tell it’. 

Sometimes too a clitic (especially the reflexive) can attach to the infinitive when 
there is a close semantic connection between the two: 

(86) French Nous I'avonsfait se laver. Span. Lo hicimos lavarse (but Ital. 

*L'abbiamo fatto lavarsi) ‘We made him wash (himself)’. 

Note that in older French texts a reflexive pronoun would not be used in this 
context. 

With modals, most languages prefer climbing, but permit the attachment 
of the object clitic to the infinitive: 

(87) Ital. Laposso vedere / Posso vederla; Span. La puedo ver I Puedo verla ‘I can see 
her’. 

In modern Portuguese non-climbing is preferred, but in Brazilian the clitic 
can attach proclitically to the infinitive, rather than enclitically to the finite 
verb: 

(88) (European) Pode-me dizer? I Pode dizer-me? (Brazilian) Pode me dizer? ‘Can 
you tell me?’ 

Rumanian, which permits an infinitive construction with putea ‘to be 
able’, always shows climbing in this construction: 

(89) Opot vedea ‘I can see her’. 

Climbing in French was usual until the seventeenth century, when it was 
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ruled out by grammarians, largely on the grounds that it is more logical to 
attach the pronoun to the verb which governs it. In the modern language 
the object pronoun is proclitic to the infinitive: 

(90) Je peux la voir / older Je la puis voir. 

In older French texts only a tonic pronoun could appear before the infinitive: 

(91) Le roy veult soy acquitter envers toy ‘The king wishes to acquit himself towards 
you’. 

Encliticization to the infinitive was rare. 

Climbing is found with other sequences, with much variation between lan¬ 
guages, and even between individuals. The differences have provoked much 
discussion, within various theoretical frameworks. The evidence points to 
a hierarehy of preference for climbing among verbs that take the infinitive, 
with modals and aspect auxiliaries high on the list, followed by conative 
(e.g. ‘try’), motion (e.g. ‘come’) and raising (e.g. ‘seem’) verbs. 

Given the variation in usage, it is not perhaps surprising that in non¬ 
standard varieties we sometimes find clitics doubled up and attached both 
to the finite verb and to the infinitive: 

(92) Occ. (Gascony) Qu'o me vaportar-o-me ‘He is going to bring it to me’; popular 
Ital. Tiposso assicurarti ‘I can assure you’. 

This can be seen as an effect of hypercorrection where climbing is older 
and more popular and non-climbing is more modern and prestigious. 

Note that in Spanish, leismo (see above) is found less frequently with climbing: 

(93) No puedo ir a ayudarle I No lo puedo ir a ayudar ‘I can’t go and help him’. 

Where there is more than one consecutive infinitive, which one attracts the 
clitic will be subject to some variation: 

(94) Span. Te lo quiero permitir hacer I Quiero permitirtelo hacer I Quiero permitirte 
hacerlo ‘I want to allow you to do it’. Galician Comenzo-o a aprender a fazer I 
Comenzo a o aprender a fazer I Comenzo a aprendelo a fazer I Comenzo a apren¬ 
der fazelo ‘I am beginning to learn how to do it’. 

Again we sometimes find repetition of the clitic on consecutive infinitives: 

(95) Occ. (Gascony) aquethpan qu'o vaupoder-o anar-o-teportar ‘that bread that I 
am going to be able to go and bring to you’. 

In the early history of Romance, clitic climbing seems to have been regu- 
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lar, not only with causatives and perception verbs, but also with modals, 
which are normally followed by a bare infinitive. One assumes that there 
was considerable cohesion between the sequential verbs, so that they filled 
a single verb slot. The climbing pattern seems sporadically to have spread 
by analogy to other finite verb + infinitive sequences, even where the infini¬ 
tive is introduced by a preposition (or complementizer). However, in the 
modern period there has been some reaction against climbing, with a ten¬ 
dency, prompted by logical and semantic considerations, to cliticize the 
object pronoun to the verb to which it most closely relates. This process is 
most advanced in French, where climbing with modals is no longer per¬ 
mitted in the standard, following the intervention of language arbiters. 

(iv) Clitic arrays (or clusters) 

One effect of the abundant use of clitic pronouns in Romance is that it is 
possible, theoretically at least, to have a whole sequence of clitics before or 
after the verb. In practice, it is quite rare to have an array of more than two 
clitics, and virtually unknown to have more than three, though seemingly 
superfluous clitics may appear, usually to involve participants in the dis¬ 
course (typically persons 1 and 2 datives of interest or ‘ethic datives’). 

Each modern standard has set rules on how the clitics should be ordered 
in an array. Most of these rules do not reflect medieval usage, which was 
more flexible (though it is not always certain from the textual evidence 
whether or not the pronouns were clitics and which pronouns received spe¬ 
cial emphasis). On the whole the rules favour orders that, in proclisis, place 
first-, second-person and then third-person reflexive clitics before other 
third person clitics, dative clitics before accusative, and adverbial clitics 
after all others. After the verb sometimes the same ordering is maintained 
and sometimes a mirror ordering is preferred. In some languages frequently 
used clitic groupings have special forms, often involving coalescence mor¬ 
phologically. This is especially so for dative-accusative sequences: 

(96) Rum. /-/ for ti it, Ital. glielo etc. for gli to. Span, se lo etc. for le to, Ptg. llo for Ik 

to, Sard Hu for li lu. Cat. //'/for li lo. (In these last two cases a sequence with the 
adverbial clitic, bilu and I'hi, is often preferred.) 

There is also a tendency to avoid the ambiguity that would arise from 
using clitics for both accusative and dative first- and second-person refer¬ 
ence, as most of the languages have no distinctive form for these cases (see 
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3.3). Even where there is a separate form, as in Rumanian, the accusative 
tends not to be used in arrays. Not infrequently, too, there is a tendency to 
omit third-person accusative reference from the clitic array. 

Beyond that there are curious differences of convention between the dif¬ 
ferent languages - though it should be noted that nonstandard use does 
not always conform to the set conventions. Discussion abounds on the 
ways to account for the differences, which sometimes are described in 
terms of surface filters that have no underlying validity, of the case mark¬ 
ing on the clitics, or the degree of activity or prominence of the partici¬ 
pants in the activity denoted by the verb. Among the ordering puzzles 
most discussed is the French third-person accusative-dative ordering {le 
lui, les leur compared with dative-accusative in me le etc.), the French 
ordering of adverbial clitics {y en), Spanish ‘spurious’ use of an apparent 
reflexive se (in se lo etc.) and the anomalous treatment of impersonal si in 
Italian. 

In the first case, the modern third-person ordering seems to be a leftover 
from the medieval regular ordering of accusative third-person pronouns 
first in the array, when there was encliticization to the first element in the 
clause: 

(97) Jet VOS di modern Je vous le dis ‘I say it to you’. 

This ordering was found before the thirteenth century, not only in French 
but also in Occitan, Catalan and some Northern and most Central Italian 
varieties. The change seems to be coeval with the change from enclisis to 
proclisis: in French the modern ordering is found from the fourteenth cen¬ 
tury but was not mandatory till the eighteenth, after much dispute among 
grammarians, first being codified in 1640. In Old French the sequences le 
li, le leur were rare, and the dative form li may have been interpreted as I’y, 
with the adverbial clitic playing the role of dative pronominal, as in Cat. 
I’hi, Occ. hi. From the fourteenth century the masculine tonic form lui 
began to replace li, ousting it completely by the end of the fifteenth century. 
With longer dative forms prefixation of the [1(9)] of the maseuline 
accusative, in particular, appears more natural. Modern non-standard 
usage often omits the accusative clitic, as was normal until the seventeenth 
century, but language arbiters insist on the overt mention of each argu¬ 
ment of the verb. 

In written Occitan the new {mi lo) order was adopted from the eigh¬ 
teenth century, probably under French influence, but the old order is still 
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found in some varieties, and in the Francoprovengal of the Swiss Valais. 
Standard Catalan has adopted the Spanish ordering, though some vari¬ 
eties retain the French. Tuscan abandoned its older ordering, usually 
enclitic (e.g. lo mi), by the fifteenth century (modern me lo, both proclitic 
and enclitic), owing, it is thought, to penetration from other dialects, 
where dative-accusative ordering was the norm. The change of vocalism 
in mi > me etc. may be influenced by the growing use of the adverbial ne, 
which has from the fifteenth century, now nonstandard, functioned as a 
third-person accusative clitic: me ne for mi ne was a common sequence 
from the thirteenth century. With third-person sequences like lo li there 
had been vowel assimilation from the thirteenth century (> liU)\ the form 
glide (for lele) is attested from the fourteenth century, with glielo found 
from the fifteenth century. 

As for the French ordering y en (which now virtually never occurs 
except in il y en a ‘there is some of it/them’), this was imposed by the mid¬ 
seventeenth-century grammarians, even though until the sixteenth century 
en i had been usual: 

(98) XXXen i ad = IIy en a trente ‘There are thirty of them’. 

Perhaps we can ascribe the change in order to the loss of syllabic status of 
y (ily a [ija]). In Italian the equivalent sequence is ci ne, and in Sard bi nde, 
but in Catalan the opposite ordering, n’hi, is preferred. 

The Spanish ‘spurious se' is usually seen as due to a sixteenth-century 
re-interpretation of Old Spanish dative-accusative sequences, e.g. gelo 
(with regular early development of [Ij] in illIillu > [eljelo] > [selo], and 
subsequent regular devoicing to [Jelo]). The [Je] form seems to have been 
identified with reflexive se. In the modern Iberian standard languages the 
se form, whatever its function, heads all clitic arrays: this has been 
explained in terms of its comparative neutrality in relation to case, anima¬ 
cy and deixis. 

In standard Italian so-called ‘impersonal sf (see below) always appears 
later in clitic arrays than reflexive si: 

(99) Lo si compra ‘One buys it for oneself’ / Se lo compra ‘He buys it for himself’; 
cf Ital. Non mi si asculta / Span. No se me escucha ‘One doesn’t listen to me’. 

When two si's are juxtaposed in the standard the adverbial or person 4 ci form 
is substituted for the first one: this is usually seen as due to a surface constraint 
on repetition of the same form. It was not found before the sixteenth century 
and was not ruled obligatory until the nineteenth century. Note the relation- 
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ship between the ‘impersonal’ and person 4 (ci) semantics and that ci is the 
normal reflexive form in some dialects. In the standard the impersonal si is not 
used when there is already a ci si sequence: 

(100) Uno ci si avvicina / Vi ci si avvicina ‘One approaches there’; see R-R (Friulian) 

Si consolasi ‘On consoles oneself’ 

where in the first sequence ci is the adverbial and si the inherent reflexive, and 
in the second, with adverbial vi, ci is reflexive and si impersonal. In the Friulian 
example the first si is interpreted as the ‘impersonal’. 

The rules for the use of impersonal si, which appears not to have devel¬ 
oped at the start of the modem period (see below), are complex and sub¬ 
ject to variation, especially when agreement phenomena are involved. 
Note especially variation in past participle and auxiliary agreement: 

(101) Ci si ^ comprati due penne / Ci si e comprate due penne I Ci si sono comprate due 
penne ‘One bought two pens for oneself’. 

A complicating factor is that for some speakers impersonal si appears to 
act functionally as a subject pronoun, and that in modern usage it is often 
equivalent to noi ‘we’ (see below). 

(v) Impersonal (indeterminate) se and on 

French differs from most Romance languages which make use of an ‘imper¬ 
sonal’, or rather ‘indeterminate human’, se constmction, preferring the use 
of on. This usage is traditionally associated with Germanic influence. 

(102) Ital. Si mangiava il cioccolato = On mangeait le chocolat, ‘One was eating choco¬ 
late’; Rum. S'agdsit bdiatul -On a trouve legarfon ‘One found the boy’; Span. 
Se quemo el dulce = Ona brule le dessert ‘One burnt the pudding’; Ptg. Vive-s bem 
aqui = On vit bien id ‘One lives well here’. 

There are some similar se uses in formal French style, but they are subject 
to strict constraints, especially as regards the definiteness of the object 
noun phrase, and the lexical status of the verb. 

(103) II sepenseplus de chases qu'il ne s’en dit ‘One thinks more than one says’; II se 
mange beaucoup de spaghetti en Italic ‘A lot of spaghetti is eaten in Italy’. 

In French the Romance SE passive (see 4.7) has never known quite the 
popular success that it enjoys in the other languages. The sixteenth- 
century growth in the use of this construction, usually ascribed to fashion- 
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able imitation of Italian and Occitan, was condemned by purists, who pre¬ 
ferred, for instance. On mange la soupe to La soupe se mange ‘Soup is 
eaten’. It was at about the same period that in other languages the imper¬ 
sonal construction developed out of the passive. 

The distinction usually made in Italian between the passive and the 
impersonal constructions concerns verb agreement. ‘Agreeing’ construc¬ 
tions: 

(104) Simangiano idolci! Idolcisimangiano ‘Sweets are eaten’ 

are labelled as ‘passive pronominals’, where the ‘derived subjeet’ (‘logical 
object’) triggers agreement. ‘Non-agreeing’ constructions (sometimes con¬ 
demned as nonstandard): 

(105) Si mangia i dolci ‘One eats sweets’ 

are viewed, on the other hand, as active impersonals, with si playing the 
role of an indeterminate agent, acting as an argument of the verb. These 
latter constructions seem to be acquired earlier by Italian-speaking chil¬ 
dren. 

A distinction is made by some analysts on the basis of word order, so that 
where the ‘logical object’ precedes the verb, in thematic position, it has become 
a ‘derived subject’ - as in passive constructions. Where the ‘logical object’ is in 
postverbal focus position, the si is regarded as the subject (or as co-indexed 
with an empty subject slot), and the ‘agreeing’ and the ‘non-agreeing’ construc¬ 
tions can be seen as stylistic variants. Alternatively, the ‘agreeing’ construction 
is regarded as ambiguous between ‘passive’ (with stylistic inversion) and 
‘impersonal’. 

One question that puzzles grammarians is how both ‘agreeing’ and 
‘non-agreeing’ constructions can coexist synchronically. What is their rela¬ 
tive status? How did they arise diachronically? It is in Spanish and 
Portuguese that there is most disagreement - even about the raw data. 
Some purists deny the existence of the non-agreeing constructions, or, at 
any rate, condemn them either as solecisms or as hyperurbanisms. Others 
see the agreeing constructions as anomalous (‘acceptable’ but ‘ungram¬ 
matical’) and the (admittedly less frequent) ‘non-agreeing’ constructions 
as basic. 

It is tempting to suggest, with some, that in some Romance varieties - 
Rumanian, for instance - the apparent ‘impersonal’ is merely a ‘slovenly’ 
version of the ‘passive’, with neglect of verb agreement. Non-agreement is 
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attested from the fourteenth century in Spanish and Portuguese, but 
apparent examples may result merely from a scribe’s omission of the tilde 
that would conventionally distinguish third person plural from the singu¬ 
lar form. It is really only in the sixteenth century that undisputed examples 
begin to abound. 

It can be suggested that the impersonal se construction results from a 
re-interpretation of an inverted {se VS) construction as non-inverted 
(SVO). The difference in semantics between an agentless passive and an 
indefinite agent construction is often so slight that one merges into the 
other. Whether indeed a new construction arose at the beginning of the 
modern era, or merely that the semantic range of the old construction 
widened, is a matter for debate. 

In Spanish and Italian, at least, however, the ‘impersonal’ se acts syn¬ 
tactically in different ways from ‘reflexive’, ‘passive’ or ‘middle’ se. How 
and when did the differences arise? 

In Spanish, the appearance of the object marker a before human 
patients (the ‘prepositional accusative’; see 3.3), in se constructions with 
human-agent transitive verbs, seems to mark a change in interpretation. 

(106) Se mataron los cristianos 

was ambiguous between a two-participant and a reflexive interpretation: 
‘The Christians killed themselves / each other’, or ‘The Christians were 
killed (by unspecified persons)’. 

(107) Se mold a los cristianos 

on the other hand, can have only the latter interpretation. The not infre¬ 
quent ‘agreeing form’: 

(108) Se mataron a los cristianos 

is often condemned as illogical and ungrammatical. 

Historically the more or less consistent use of the ‘prepositional accusative’ 
dates from the fifteenth century; from the sixteenth century it became part of 
standard usage (though even today there is regional and social variation). 

Spanish pronominal usage with intransitives, sometimes labelled as anom¬ 
alous or as a Gallicism, is attested from the sixteenth century: 

(109) Se vivefeliz ‘One lives happy’. 

It is found principally with verbs that can have a ‘cognate object’, and with 
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the BE verbs, ser and estar (see below). Although an adjectival comple¬ 
ment, as in: 

(110) Sees bueno ‘One is good’, Se vive tranquilo ‘One lives quietly’ 

appears in the masculine singular form in the standard, when reference is 
made to a human female (usually the speaker), the feminine singular is 
widely acceptable. Plural agreement is less acceptable, though not always 
rejected. 

In contrast, similar forms in standard Italian require a masculine plural 
agreement, as in: 

(111) Sie conlenti ‘One is content’. 

This seems to be connected with the interpretation of si as noi ‘we’, an 
allegedly Tuscan feature that gained widespread popularity in the nine¬ 
teenth century. The masculine plural agreement is seen as indicative of an 
impersonal usage, whereas, for female speakers, a feminine plural agree¬ 
ment would imply a first person plural subject. 

Unambiguous examples of ‘impersonal sf are hard to find before the 
fourteenth century (one difficulty is caused by the non-differentiation in 
the older graphy between si and si). The construction with unaccusative 
verbs: 

(112) Si parte ‘One departs’ 
postdates this period. 

In Spanish, Italian and Portuguese ‘impersonal se’ was a post-medieval 
development which grew up when the grammars of the standard lan¬ 
guages were being codified and when the vernacular was being used to 
treat of ‘serious’ topics, which lend themselves more to depersonalized, 
objective statements, focusing on the activity rather than the agent. The 
‘morphological’ passive, which in Romance has stative or resultative over¬ 
tones, was overtaken for this function by the more dynamic pronominal 
construction. 

French possibly developed earlier than the other languages a lexical 
means (on) of expressing the notion of ‘indeterminate agent’, though it is 
not the only language in which the pronoun is found. However, in Occitan 
and Catalan, at least, it is less popular than the ‘impersonal se’ construc¬ 
tion. 

Nearly all the languages in their earlier stages seem to have had access to a 
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HOMO, HOMINEM ‘man, human being’ form, cognate with French on, which 
could be used to designate an unspecified human participant in an event. In 
most Western standards this fell out of use at the beginning of the modem 
era. It has been suggested that one of the reasons for the demise of homo 
forms was that they were popular in flavour and associated with first-person 
semantics, both singular and plural. They tended to be replaced by the more 
formal, less personal and unambiguously singular, reflexes of unus. 

When the homo forms came to be interpreted as an indeterminate 
human (‘anyone’) rather than as an indefinite (‘man, a man, someone’) is 
hard to say. The widely accepted suggestion is that from the beginning in 
French on had indeterminate meaning. Usually the evidence of the 
Strasburg Oaths {om, used in parallel with German man) is thought con¬ 
clusive, but in that text om is separated from its verb, in a manner that sug¬ 
gests that it is not an unstressed pronoun, but a full lexical noun. The 
‘indeterminate’ meaning of on was well established, but not categorical, by 
the twelfth century; by the fourteenth century it flourished, but even then 
there was still a possibility of full lexical interpretation. 

One feature of French that favoured the differentiation of the indeterminate 
pronoun from the full lexical noun was the medieval retention of a two-case 
system (see 3.3), so that nominative homo > on (en, an) was formally distinct 
from HOMINEM > homme (omne, ome) - even though the case distinction 
between the two forms was not consistently adhered to. The use or non-use of 
the definite article with the two forms was not a diagnostic feature: even today 
I'on can be used as an elegant variant of on, and dialects often have a similar 
form as their only variant. 

Perhaps the question, of when on came to denote an indeterminate 
human, should be approached from a different direction, asking when on 
was grammaticalized as an indeterminate agent clitic. It is noticeable that 
in many of the earlier examples on appears postverbally; at the same peri¬ 
od subject pronouns were usually omitted in inverted declarative construc¬ 
tions. The regular spelling-out of on would seem to indicate that it was not 
unequivocally equated with subject pronouns. Later, when the subject pro¬ 
nouns were obligatorily proclitic to the verb in finite declarative clauses, on 
could be ranged among the other subject clitics. We know that Freneh had 
definitively made this change by the seventeenth century. 

Another pointer to the grammaticalization of on may be found in the pronom¬ 
inal forms with which it is co-indexed. Until the sixteenth century the third- 
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person pronouns il, le and lui could refer back to on, but in modern French soi 
is reserved for reference to on and a few other ‘indefinites’, while se is the object 
pronoun used for cross-reference within the clause (nous or vous sometimes 
being used over longer stretches of speech). This usage was standardized in the 
seventeenth century. 

It is therefore possible to see modern French on as the subject form of se, a 
non-referential clitic, of which the disjunctive equivalent is soi. In this per¬ 
spective, the difference between French and those languages which prefer 
the impersonal se construction in a similar function is indissolubly con¬ 
nected with its ‘non-pro-drop’ character (see 1.2). In French on ‘absorbs’ 
its se, except in true reflexive constructions, and with inherent pronomi- 
nals; in the other languages se refers back to an empty on slot. 

The French ‘impersonal’ constructions exemplified at (103) above may also be 
seen in the same light, with il, not on, as the overt subject. On in modern times 
has taken on first-person semantics, with more ‘involvement’ in the discourse 
situation. More formal objective styles have consequently turned to the imper¬ 
sonal construction with ilXo emphasize the indeterminateness of the agent. 
Colloquial styles, on the other hand, prefer to use the familiar second-person 
form tu for indeterminate reference. 


(c) Determiners 

(i) Partitives 

Modern French differs from the other Romance languages by its reluc¬ 
tance to use determinerless noun phrases. The so-called partitive articles 
du, des etc. replaced, by the seventeenth century, constructions without an 
article in examples like: 

(113) Donnez-moi (du) vin ‘Give me (some) wine’; J’ai vu (des) chevaliers ‘I saw 
(some) knights’. 

These determiners are clearly derived from de -l- definite article sequences 
{de le > del > du originally preconsonantal; de les > dels > des). They can 
be related to the pan-Romance prepositional partitive construction with a 
quantifier: 

(114) beaucoup des chevaliers ‘many of the knights’ 

and to what is sometimes known as the pseudo-partitive; 
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(115) beaucoupipas de chevaliers ‘many/no knights’ 

where the de may be regarded as a case marker. 

Constructions like these were quite frequent in French by the twelfth 
century, and partitive articles began to be used in the thirteenth century. A 
similar chronology seems to be valid for Italian, but here the standard lan¬ 
guage does not absolutely preseribe use of the partitive article. It is more 
characteristic of spoken use, and of northern regional varieties, than of 
formal language. 

The definite-article component of the partitive-artiele forms must have 
generic, rather than specific, meaning: the generic use of the definite article 
was not found in the early medieval texts, so it is not surprising that the 
partitive-use article developed quite late. The de eomponent of the parti¬ 
tive article can be analysed as a case marker, with a function similar to the 
partitive genitive of Latin. 

It is usual to describe the French singular partitive article, subject to cer¬ 
tain lexical and syntactic constraints, as determining non-count nouns. 
The plural partitive is usually seen, on the other hand, as the plural of the 
indefinite article, which is identical with the numeral ‘one’ (UNUS). 

Some languages, like Spanish, Catalan, Portuguese and Rumanian, make 
use of a plural form of the indefinite article, translatable as a quantifier 
‘some’. In Portuguese it is little used and conveys the meaning of ‘approxi¬ 
mately’; in medieval French and Italian similar forms referred mainly to 
collectives, like ‘a pair’, and died out at the start of the modern period. 
Normally, however, the Romance indefinite plural has no overt determiner. 
There are syntactic restrictions on the use of determinerless noun phrases, 
which tend not to occur in preverbal (topical) subject position. 

Examples of partitive determiners, similar to those of French, are 
attested for medieval Spanish, and there remain relics in some dialects, but 
the modern standard language gives preference to determinerless con¬ 
structions, with indefinite reference. 

I have already suggested a possible link between the partitive and the 
adverbial inde. In Catalan and Sard, where the adverbial survives, the par¬ 
titive noun phrase, as in early French and Italian, is marked by (preposi¬ 
tion or case marker) de without a determiner: 

(116) Sard Bimus abba I (De abba) nde bimus ‘We drink water’ / ‘(Of water) we drink 
some’, Cat. Tenia gossos / En tenia (de gossos) ‘He had dogs’ / ‘He had some (of 
dogs)’. 
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French determinerless noun phrases were virtually banned from the lan¬ 
guage at the time of standardization. Constructions like: 

(117) J’aifaim ‘I am hungry’ [‘I have hunger’] 

tend to be treated in the modern language as single units, rather than as 
verb + noun sequences. For instance, yawn in this sentence cannot pronomi- 
nalize: see 

(118) Spanish Tengo hambre I ILa lengo, Ital. Ho fame / ^.Ne ho, but French *Je I'ai, 
*J’en ai. 

Standard Italian is more tolerant than French, with the north tending to 
prefer the radical French solution, and the south the more conservative 
Catalan/Sard one. 

(ii) Possessive and demonstrative determiners 

Modern French, I repeat, eschews determinerless noun phrases: apart 
from the definite and indefinite articles developed by all the Romance lan¬ 
guages, French, in the early modern period, also introduced possessive and 
demonstrative determiners that are formally and functionally different 
from the related pronouns. 

We have already seen (3.3) that pronouns and determiners are historical¬ 
ly related: Latin did not have overt determiners but sometimes used, for 
emphasis, pronouns in adnominal position. It has been suggested by mod¬ 
ern theorists that the relevant pronouns are determiners (heading a DP) 
with an empty noun slot, or that they occupy the noun slot in a structure 
in which there is no determiner slot. For Latin the latter description seems 
appropriate, but as, over time, adnominal (appositional?) use of the pro¬ 
nouns increased there was reanalysis and the determiner slot came into 
being. In French this slot nearly always has to be overtly filled; in the other 
languages, as we have seen, non-topicalized, non-specific nouns can 
appear with a zero determiner. 

A formal distinction arose in Romance between stressed and unstressed 
pronouns, as a result of differential phonological treatment. In Old French 
the distinction was particularly salient, because of the radical effects of diph- 
thongization under stress and reduction of atonic syllables. In the case of 
possessive and demonstrative pronouns, the atonic (weak) forms Were often 
used adnominally; one assumes they carried less emphasis than their tonic 
counterparts. At the end of the Old French period, when there appears to 



When did the Romance languages differentiate? 


Ill 


have been a change in the stress patterning of French, the old phonological 
distinction became opaque, and there was a redistribution of forms, by 
which, by and large, the weak forms were specialized as determiners, while 
the strong (tonic) forms were the basis for the creation of modem pronouns. 
This distinction was not made so clearly in the other Romance languages. 


Possessives 

Let us look first at Romance possessives (see 2.2). In most modem lan¬ 
guages the Latin forms survive as adjectives requiring the addition of a 
determiner, while the modern pronominals have identical forms, usually 
with a definite article and a deleted lexical noun: 

(119) Ital. it mio (cavallo) / (more literary) it (cavallo) mio. Rum. calul meu, Sard su 
caddu meu. Span, it (caballo) mio (see also ‘neuter’ to mio), Ptg. o meu (cavalo). 
Cat. el (cavall) meu / el meu (email) ‘my horse’. 

In most languages the determiner with the adjectival form can be omitted 
in certain circumstances - for instance, when family relationships are 
referred to, or when the possessive is used predicatively. In some languages, 
particularly Italian, the possessive tends to be preposed to the noun, 
whereas adjectivals most often follow (see 3.4). 

In French the old use: 

(120) le mien cheval 

was condemned as archaic in the seventeenth century. The pronominal 
form can be used in the modern language only with the definite article, le 
mien, and adjectival use of the erstwhile strong forms is no longer allowed. 

In Rhaeto-Romance the strong forms are used as determiners without an article 

(121) Engadine mieu chavagl, Surselvan miu cavagl. 

It is thought they are modem substitutes for earlier weak forms. 

Weak forms are found in most of the early Romance languages, used 
sometimes enclitically (see 2.2) but mainly proclitically to the noun phrase. 
They have disappeared in most modem languages, but survive as deter¬ 
miners in French and Spanish (also in North Italian dialects): 

(122) mon cheval, mi caballo. 


The differentiation between forms with and without a determiner in the 
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older languages is not easy to understand. Sometimes weak forms were 
accompanied by an article, but usually it was the strong form. Use of the 
demonstrative was not infrequent, as in modern Spanish: 

(123) este caballo mio ‘this my horse’. 

It is possible to see the so-called article forms which accompany the pos¬ 
sessive as originally pronominal, used for emphasis. The change in the 
modern languages is that they have become determiners. In French and 
Spanish the weak possessives also became determiners, rather than adjec¬ 
tives, and so a functional distinction arose to match the formal distinction 
that had originally been phonologically motivated. 

Another way in which French, here accompanied by Rumanian, stan¬ 
dard Italian, Swiss Rhaeto-Romance, Occitan and Catalan, differs from 
other Romance languages is by developing, at a very early period, a person 
6 possessive from the genitive plural of the Latin distal pronoun illorum. 
Sard uses a similar form derived from ipsorum. In Rumanian lor takes the 
form of a case flection of the personal pronoun (see casa lor ‘[the] their 
house’ / caselor sale ‘of his houses’). In the other languages, except Sard 
and Italian, similar possessives are not accompanied by the artiele: 

(124) Ital. la loro casa, Sard sa domo issoro / R-R (Engadine) lur chesa, French leur 
maison, Cat. lltir casa ‘their house’. 

French alone inflects this possessive form for the plural to agree with the 
following noun. 

French leur and Italian loro are used as person 6 dative pronouns; although 
originally a stressed form, it is cliticized in French. In Italian, where it is char¬ 
acteristic of the modern standard rather than of colloquial speech, it is used 
disjunctively (see 4.4). 

The Other Romance languages use the original reflexive suus as posses¬ 
sives for both persons 3 and 6. As Romance third-person possessives do 
not distinguish the gender of the possessor, a preposition + strong pronoun 
sequence is often substituted, or added, where there may be ambiguity: 

(125) Ital. la di lei casa, Sard sa domo de issa; French sa maison a elle. Span, su casa 
de ella ‘her house’ (cf Rum. casa lui, casa ei, with the inflected pronoun, used 
frequently for sdu casa ‘his/her house’, also casa lui Jon ‘John’s house’ etc.). 
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Demonstratives 

We saw (3.3) that Romance definite articles derive from Latin demonstra¬ 
tive pronouns, usually the distal ille. Latin has three grades of demon¬ 
strative, which match the three persons of the verb. The Latin proximal 
Hic did not survive in its simple form anywhere. The second-person 
demonstrative, iste, took on the proximal role in most languages, usually 
reinforced by (AC) ecce/u ‘behold’: 

(126) Rum. dst. Span, este, Ptg. este / Rum. acest, cest, Ital. questo, Sard custu, Cat. 
aquest, R-R (Surselvan) quest. Old French icest, Occ. cest. 

The distal demonstrative was always an extended form of ille: 

(127) Rum. acel, cel, Ital. quello, Sard cuddu, R-R (Engadine) quel. Span, aquel, Ptg. 
aquele. Cat. aquel. Old French icel. 

A middle-distance demonstrative survives in some languages, though it is 
not as vigorous as the other two: 

(128) Ital. codesto < ecce ti istu; Span, ese, Ptg. esse < ipse; Sard cussu. Cat. aqueix 

< ECCE IPSE/U. 

In Spanish the demonstrative can be still be used as an adjective, as well as as a 
determiner: 

(129) el caballo ese ‘the horse over there’. 

Modern French differs from the other Romance languages in two impor¬ 
tant respects: 

1 There is no difference between the forms of proximal and distal demonstrative, 
so that the semantic distinction must be conveyed by the addition of a locative 
particle («‘here’ < ecce hic / Id ‘there’ < illac). 

2 There is a formal difference between determiners and pronouns, whereas the 
other languages use the same form for both: 

(130) determiners ce(t), cette, ces I pronouns celui, celle, ceux, celles. 

The determiners are seen to be derived from the old proximal demonstra¬ 
tive, while the old distal demonstrative has specialized as a pronoun. There 
are, however, some difficulties raised by the masculine singular preconso- 
nantal form ce which has neuter pronominal status in some contexts: e.g. 

(131) ce ‘that which’: 

and which seems to be a continuation of the old neuter pronoun fo 
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(< ECCE hoc). Moreover, synchronically it is possible to analyse the pro¬ 
noun as a determiner -l- strong third-person pronoun', ce+lui, ce+elle (5), 
ce+eux: making it a full noun phrase (or DP). 

What is relevant for our present discussion is that these changes in 
French took place at the end of the medieval period, when there seems to 
have been a radical upheaval of the morphological system and when 
closed sets of determiners, distinct from the related pronouns, were creat¬ 
ed. The other Romance languages do not as clearly distinguish determin¬ 
ers, adjectives and pronouns. 

6.6 CONCLUSION 

In trying to answer the question ‘When did the Romance languages differ¬ 
entiate?’ I have also obliquely addressed the more difficult question: Why? 

The examples I have examined in this chapter, and in previous chapters, 
of differences between the languages have pointed to early differentiation 
(or rather variation) in phonology and lexicon, but later splits in morphol¬ 
ogy and especially syntax. Many of the differences that modern theorists 
pay so much attention to arose in the modern period, and often only in the 
standard forms of the languages. Nonstandard varieties often preserve 
remnants of old uses that have been discarded by the prestige languages. 
The standards themselves have often been codified partly with the aim of 
differentiating clearly between the languages. It is to morphology and syn¬ 
tax - grammar - and to elaborated written style that the language codifiers 
paid most attention. Phonological differences, which loom so large in the 
work of traditional Romance philologists, were comparatively little consid¬ 
ered by those who sought to illustrify the vernaculars in an attempt to rival 
Latin. Thus it may be that the differentiation is as much a result of the dif¬ 
ferent representations that prestigious users had of their languages as of 
external influences or of internal processes. 

In chapter 4, I emphasized the forces for cohesion within the Romance 
linguistic world. Here I have been considering forces for dispersion. The 
two forces have always been in conflict, but at different times one would win 
out over the other. In the next chapter I shall consider the ‘How?’ question. 
I shall thus be concerned with the more general problem of how languages 
change, illustrating my discussion with examples from Romance, whereas 
in this chapter my concern has been more parochial, starting from differ¬ 
ences within Romance and considering when, and why, they came about. 
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How did the Romance languages 
differentiate? 


7.1 PROCESSES OF CHANGE 

Assuming that the Romance languages were once one language, differenti¬ 
ation may be compared with speciation in evolution. With isolation of one 
linguistic group from another, one would expect that they would each 
select and develop different variants in their parent language, or, in addi¬ 
tion, that each group would produce its own variant. As, over time, each 
group favoured internal interaction at the expense of communication with 
other groups, one would expect the differences between the groups to 
become accentuated. Thus each group would develop its own distinct lan¬ 
guage, which could be cherished as an emblem of its identity, and therefore 
be to some extent fossilized. 

This story raises problems for the Romance languages, as, except for 
Rumanian, they were never isolated from each other, commercially or cul¬ 
turally, and state frontiers have been mobile, even in the modern period. It 
also raises questions about the dimensions of the linguistic group that 
breaks away from its fellows. Are we talking of a family, a village, a region, 
or a country? It was only in the early modern period that the state came to 
be regarded as a linguistic domain; even today in more geographically 
inaccessible areas, as in the Alps, southern Italy or central Sardinia, each 
village retains its own linguistic identity, having to communicate with its 
neighbours in an alien language. Yet even here there has always been some 
intercommunication, involving either bilingual exchange or more often the 
use of a common interlanguage (now usually the official state language). 
In the Middle Ages it is assumed most Romance speakers were more or 
less captive territorially and that the Church diocese or the catchment area 
of a market town would form the domain of a linguistic grouping. This, 
however, was only partly so, and there were always pedlars, strolling play- 
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ers, mercenaries, itinerant friars who linked one community with another. 
Even though the everyday language of one region may have been barely 
intelligible to inhabitants of another, there was probably no consciousness 
of a gulf between the languages. Western travellers in Rumanian-speaking 
regions in the fourteenth century described the language as a variety of 
Italian, and indeed the differences between Rumanian and other Romance 
languages were probably not perceptibly greater than those between mod¬ 
ern ‘Italian dialects’. 

So our question should cover both real and perceived differentiation. 
We saw in chapter 5 that the perception of difference came to prominence 
from the early modem period, when language began to be associated with 
nationhood. Codification concentrated on written language, and tended to 
ban variation, so that orthography and syntax were strikingly differentiat¬ 
ed during the process of standardization. 

‘Natural’ processes must, however, have been giving rise to real differen¬ 
tiation long before that, as phonological and lexical variants were adopted 
and generated differentially by communities. We have already seen that in 
many cases the path followed by a Romance language is familiar from 
other language families. There is usually no need to make appeal to, for 
instance, contact phenomena as a necessary or sufficient cause of changes 
discussed in chapter 6: at best, stratal influences may have done little more 
than slant the course in one direction. 

Yet we should, I think, resist the temptation to write off all change as 
random, and therefore uninteresting. What is remarkable in Romance is, 
as we saw in Part I, just how much has not changed, how stable the lan¬ 
guages have remained over two millennia, so that they can still in some 
senses be considered as a single linguistic unit. It is a puzzle why a lan¬ 
guage should fall into stasis, comparative stability, in spite of the potent 
pressures for change that must be exerted through daily use by diverse 
individuals and communities, and by the hazards of the acquisition 
process. Universal grammar, constraints imposed on language in general, 
may account for the narrow restrictions in possible syntactic change. 
Phonetic factors, articulatory and acoustic, must place a limit on what 
phonological changes can succeed. But it is, I claimed in Part I, in 
Romance morphology and lexicon that we find most stability. 

Here is not the place to discuss in detail theories of change: I shall merely 
refer to Roger Lass’s inspiring forthcoming work on Historical linguistics 
and language change. In particular, I pick up the insightful image of Ian- 
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guage (or at least the ‘grammar’ without the semantics) as a dynamical sys¬ 
tem in which apparently swirling movement tends to gravitate towards 
‘attractors’ or ‘sinks’, and there to settle until a jolt is administered to the 
system. In language, grammaticalization and lexicalization may represent 
sinks of this sort. The morpheme or word allows economieal transmission 
of information and is readily acquired: in a social environment that favours 
smooth transmission of knowledge from one generation to another, lan¬ 
guage-specific conventions like morphology and lexicon may resist the 
attrition and distortion that inevitable phonetic variation must occasion. 

In Romance, accepted wisdom sees the first millennium as a period of 
apparently rapid change, followed by comparative stability during the 
Middle Ages, with again a transition leading to the apparent immobility of 
the modern standard languages. It is disputed how far the periods of sta¬ 
bility are illusive, merely ‘artifacts of textual conservatism’ (to quote Lass). 
The medieval texts on which we base our knowledge of language use at the 
time may be carefully crafted and may conceal, rather than reveal, linguis¬ 
tic reality. The modern standard languages, I repeat, have been subject to 
conscious manipulation and codification, deploring variation, rigidly dic¬ 
tating standards of correctness, and condemning socially unprestigious 
usage. 

It is often assumed that under the protective superstructure of a stable 
written language, underground forces were mining away at the foundations 
of real-life speech, which then would emerge (more or less) triumphant 
when there was a switch in prestige or of perception. It is also often 
claimed that the periods of apparent stability depended on the solidity of 
transmission of tradition, usually among the educated elite, but with the 
implicit support of a deferential majority. Social upheaval, war, famine, 
ideological or cultural revolution, and the like could, however, shake up 
the system and start it swirling around again. 

In chapter 8, we shall discuss some of the sociological factors that 
should be considered in a discussion of the Romance languages. We have 
already considered in chapter 6 the chronology of differentiation of the 
languages: in the rest of this chapter we shall look at a few phonological, 
morphological, syntactic, lexical and semantic differences. Fitting individ¬ 
ual developments into a classificatory Procrustean bed is not always prof¬ 
itable, as multiple causation and contradictory tendencies are in play in 
most cases. However, it may be illuminating to place some Romance events 
in the context of more general processes of linguistic change. These 
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processes are of course not peculiar to Romance, and examples may seem 
commonplace to historical linguists. What is of particular interest is how 
some processes have had a free run in some languages and have been 
blocked or reversed in others. This raises the question of measures of rela¬ 
tive conservatism (or reaction) and innovation within Romance. Taking 
for granted that language is a dynamic and not a static system, in what 
senses have some varieties been less open to change than others, and what 
linguistic, ideological or cultural factors could account for the differences? 


7.2 PHONOLOGICAL CHANGES 

It seems obvious that the level of language most subject to variation is that 
of phonetic realization. The Principle of Least Effort can operate most 
freely in allegro informal speech, where phonetic short-cuts are normal, 
and where psychological, social and communicational constraints operate 
least. Much linguistic change is therefore most usually seen as originating 
in phonetic variation: historical linguistics is concerned with how, over 
time, the variation takes a definite direction and triggers a change in the 
phonological system, and how the phonological changes interact with 
requirements of the meaningful side of language. 

Romance phonological changes are readily paralleled from other lan¬ 
guages. The significant thing here is the divergent directions the regional 
varieties have taken, and how far the differences draw the line between one 
language and another. Some of the features we shall examine involve dif¬ 
ferential evolution of those we have discussed in other contexts - like leni- 
tion and fortition or cluster and diphthong resolution. Within the confines 
of this survey, the minute details of every change cannot be listed. I have 
chosen only those that are salient, and which illustrate striking differences 
between the languages. Most frequently, the most variation and rapid 
change occurred in the earlier periods, while with standardization varia¬ 
tion was reduced in the prestige variety. 

(a) French accentuation and sandhi 

We have already mentioned the suggestion that a prime reason for the dif¬ 
ference between French and the other languages is its accentuation. In Old 
French there was allegedly a ‘strong’ word stress which clearly demarcated 
each lexical unit and favoured the loss of unaccented syllables, hence the 
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shortening of the word. In modern French, on the other hand, stress falls 
regularly only on the last syllable of the rhythmic group and words within 
the group run into each other. The change is assumed to have occurred in 
the Middle French period, probably between the fourteenth and the six¬ 
teenth centuries: some link it to the troubled times, with wars, the plague, 
economic depression, population decline, internal strife. A host of other 
related changes can be related to the accentual change, some of which we 
discuss elsewhere. These include levelling of diphthongs, truncation of 
word-final consonants and vowels, phonologization of nasal vowels, weak¬ 
ening of the noun case and of verbal person inflections, with consequent 
recourse to other strategies, the use of syntactic means to secure topical- 
ization (dislocation, clefting, etc.), pronoun cliticization, strengthening of 
forms of negation and interrogation. 

Some of these phenomena are, however, found elsewhere in Romance, 
where the word accent is kept: sandhi features, such as Spanish sinalefa, 
Italian rafforzamento sintattico, French liaison, elision and assimilation 
everywhere are found within the ‘cursus’, a syntagmatic sequence often 
identified with a syntactic constituent. It is to be noted, however, that nor¬ 
mally, except in Sardinia and southern Italy, sandhi phenomena affect only 
word-finals. 

In French the erosion of word-finals threatened, in the early modern 
period, pathological homonymy: one consequence has been that final con¬ 
sonants have not infrequently been restored in pronunciation. 

(1) chefUei\ / f/e/[kle];^ni> [finiB] /porter [poBte]. 

Usually the consonant restored is represented in the graphy, though there 
are non-standard examples of final epenthesis with no etymological justifi¬ 
cation: 

(2) [isit] for/n; [finit] for^m'e. 

Incidentally modem French also seems to show a preference for consonant- 

final English loanwords. 

Reaction against the reduction and elision of the only Latin atonic fi nal 
vowel to survive into French ([o]) must have begun earlier, as the feminine 
singular definite-article form in central French is, irregularly, [la]. In 
Picardy, where the masculine and feminine forms more regularly fall 
together as [la], an originally demonstrative determiner very often replaces 
the article, perhaps to preserve gender agreement (see 1.3). However, curi- 
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ously, the masculine possessives mon, ton, son replace, in modem French, 
the feminine forms ma, ta, sa in prevocalic position. Until the fourteenth 
century elided forms (w', t’, s’) were used. A similar ‘disagreement’ is 
found in Spanish, when the definite article is used before a word-initial 
accented a-: here the masculine form el is used, instead of a feminine 1(a) 
(for the distribution of article forms see below). 


(b) Levelling of diphthongs 

If diphthongization can involve feature splitting within a vocalic nucleus, 
then monophthongization, on the contrary, may result from the simulta¬ 
neous, rather than the consecutive, realization of features: this is often the 
process that is more traditionally known as levelling by mutual assimila¬ 
tion. More often one element of the diphthong disappears or, as a glide, 
becomes part of the onset or coda. For instance, from the putative Proto- 
Romance diphthong [je] in north-east France, [i] and [u] result from loss of 
the second mora, whereas in the Central French, Italian and Spanish out¬ 
come [je] the first mora approximates to become a glide. 

In the Central varieties (and hence the standard) the rich array of Old 
French diphthongs usually levelled by the process of mutual assimilation, 
allegedly as a consequence of change in the accentuation system. 
Confusion of the Old French diphthongs [ue] (< [uo] < tonic free Latin (3) 
and [eu] (< [ou] < [eu] < tonic free Latin o) is attested by graphy (written 
eu) and assonance in the twelfth century. In modern French the result is I 0 I 
(with [oe] as an allophone): intermediate stages [uo] and [ou] are usually 
assumed, and seem to have persisted in some varieties into the seventeenth 
century. Similarly Old French [ai], as in: 

(3) fait ‘deed’ < factu, mais ‘more, but’ < magis 

levelled to [e] (via [ei]?) by the twelfth century, to be followed in the same 
direction by the later, but similar, early modern French complex nucleus, 
as in: 

(4) plaie [pie] < [plaj] <[plaj3] ‘wound’ < plaga. 

In the latter case, however, the glide pronunciation was defended by 
sixteenth-century arbiters, and remnants survive: 


( 5 ) 


paietpaye [pe] / [pej] ‘he pays’ < pagat. 
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Inherited [an] < Latin au also levelled to [o:] very early in French; the same 
course was followed by the Old French diphthong resulting from vocaliza¬ 
tion of / (see above), though here in the sixteenth century there is evidence 
of diphthongal pronunciation in some varieties. In the modern standard 
[o:] has shortened and closed to [o]. 

One feature of French diphthong levelling is worth a closer look. This 
concerns the modem pronunciation of oi as [wa], as in 

(6) moi < mei ‘me’ < me; toil < teit ‘roof’ < rficru. 

Old French [ei] differentiated by backing and rounding of the first mora. 
Conventional wisdom has it that there was also lowering to and jodization 
to [oj], differentiated in twelfth-century verse texts from [oj], found, for 
instance, in: 

(7) noix [nojts] ‘nut’ < nCce. 

Certainly the two soon merged apparently as [oj]. In North-East French 
there was simplification to [o], but in Central French the second mora low¬ 
ered and opened to [e], while the first raised and closed to approximant [wj. 

Thus, as earlier [we] (<0) backed to [wo] in the twelfth century, its place 
was taken by a very similar complex nucleus, which in its turn seems to 
have levelled to [o] in some varieties. In the central area, however, it took 
two divergent directions: either it simplified to [e] (buttressed by the earlier 
western lowering and simplification of ei, [ei] > [ei] > [e], parallel to the ai 
levelling), or the second mora lowered to [a]. 

This latter process was probably initially conditioned by a following [r], 
as early as the thirteenth century (graphy od), but generalized to become a 
feature of Parisian popular speech by the sixteenth century. Italianate 
court pronunciation favoured the monophthongal variant, but seven¬ 
teenth-century grammarians strenuously sought to reimpose [we]. In the 
eighteenth century, owing largely to Voltaire’s intervention, the [e] variant, 
for instance in the imperfect endings, was written aislait.The [wa] variant 
was accepted into prestigious use at the Revolution, but continued to be 
written oi. Today there subsist lexicalized variants, including doublets like: 

(8) Fraitfais ‘French’ / Francois ‘Francis’. 

Shift of syllabicity from one mora of a diphthong to the other is not 
unusual. The history of French oi is of interest, not only as an example of 
social and regional variation and of conscious intervention but also 
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because it invites comparison with the development of some other Old 
French diphthongs. That of: 

(9) nuit ‘night’ < nOcte 

for instance, seems to have been: 

(10) [oj] (perhaps via [uoj] breaking) > [uj] > [yj] > [qi]. 

Presumably it was at a late stage that there was western simplification of 
[oi] to [i] and central absorption of the rounded approximant by the pre¬ 
ceding labial: 

(11) vide ‘empty’ < vuide < vOcitu 

In North-Eastern dialects simplification to [u] or [y], on the other hand, 
must have happened at an earlier stage [uj] or [yjj. 

It has been suggested that the development of French ie (< tonic free fi 
or < tonic free A, with Bartsch’s Law diphthongization under the influence 
of a preceding palatal) was parallel to that of ui. Initially stressed on the 
first mora, in the North-Eastern dialects it was simplified to [ij. In the cen¬ 
tre and west (beginning in the twelfth century) there would have been 
jodization > [jej. In the west this simplified to [e] and in the centre the jod 
was absorbed by a preceding palatal: 

(12) e/ier < eWer‘dear’< CARU. 

There was some tendency in the modem period, especially in versification, to 
resyllabification, treating the two elements of the diphthong as nuclei in hiatus. 
This was especially evident in Old French monosyllables: 

(13) Aier‘yesterday’< h£ri (see Ital. icr;. Span. ayer). 

In modem French there are hesitations about the syllabic status of some ie 
sequences, which do not continue Old French diphthongs: 

(14) Her ‘to bind’ [lije] /[Ije] < ligare. 

Note that there is doubt about the phonemic status in the modern standard of 
the [i]/[j] distinction which seems to be licensed by very few minimal pairs, like: 

(15) abbaye [abei] (older abbadie) ‘abbey’ < abbatia / abeille [abcj] (older [abeXa]) 
‘bee’ < APicuLA. 
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(c) Prosthetic and paragogic vowels 

Word-initial consonants and clusters are usually in Romance strengthened 
in comparison with their counterparts in medial or final position. It is not 
unusual, either, to find vocal onsets to geminate word-initials like [rr], A 
striking case concerns so-called ‘impure’ s followed by an obstruent, where 
a prosthetic vowel is attested from second century texts. In the early histo¬ 
ry of the languages of the Iberian peninsula and of France, as well as in 
Sard, a vowel regularly was inserted, forming the nucleus of a new syllable 
with s as its coda: 

(16) SCALA ‘ladder’: Sard (Logudorian) iskala, French echelle (Old eschele), Occ., 
Cat., Span, escala; sperare ‘to hope’: Sard isperare, French esperer, Occ., Cat., 
Span., Ptg. esperar. 

This type of prosthesis, due to the phonotactic constraint that forbids 
[#sC], remains productive in the Iberian languages, so that loanwords also 
acquire an initial vowel: 

(17) esnob < English snob. 

In French syllable-final s was regularly lost during the course of the 
Middle Ages, leaving the initial [e] stranded; however, in many words the s 
continued to be pronounced, allegedly through learned influence, and in 
later loanwords there was no prosthesis: 

(18) etable ‘cow-shed’ < STABULU / stable ‘stable’ (adjective) < stabile; estomac 
‘stomach’ (thirteenth century) / squelette ‘skeleton’ (sixteenth century). 

In Rumanian the initial s may be pronounced [J] in non-standard speech, 
and prosthesis is not common; indeed a ‘mobile ’s may itself be prefixed to 
a word-initial consonant: 

(19) (s)cobort‘to go down’ (Slavonic); (s)tufis ‘bush’(< tufa). 

In modem standard Italian, prosthesis is limited to postconsonantal contexts: 

(20) in iscuola ‘in school’, per istrada ‘by road’ 

and it is not popularly used outside Tuscany and Corsica. However, what 
may be in question here is apheresis of an earlier vowel parallel to: 

(21) Spagna < hispania; storia < historia. 

This is particularly common in Italian, Rhaeto-Romance and Rumanian, 
where the s continues an earlier prefix: 
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(22) excadere: Rum. scddea, R-R (Surselvan) scatter, Ital. scadere ‘to decline’. 

On the other hand, the striking predominance in Italian of vocalic word- 
finals and of open syllables may result from insertion, rather than from 
retention of Latin atonic finals: certainly in Sard paragogic vowels are fre¬ 
quent after word-final consonants, usually echoing an internal vowel: 

(23) batorlbatoro ‘four’ < quattuor (Ital. guattro); tempus/tempusu ‘time’< TEMPUS. 

It has been suggested, then, that Italian final vowels (other than a) are par¬ 
agogic, as in: 

(24) sale ‘salt’< sal; cuore ‘heart’< coR. 

The question, then, is how it is they so often reproduce the lost Latin vowel 
rather than a mere default [e] or [o]. Here Catalan provides a partial paral¬ 
lel. In plurals of nominals in which the final vowel is lost in the singular, 
earlier epenthetic [o] has been replaced, since the fifteenth century, by [u] 
(o) in masculine nouns: 

(25) brados) ‘arm(s)’ < bracchiu; mes(os) ‘month(s)’< mense. 

The same epenthesis is found in words of Germanic origin: 

(26) bosc(os),fresc(os) 

‘Incorrect’ restitution of apocopated final vowels is also found in North 
Italian dialects: 

(27) Genoa kamiiu ‘shirt’< camice, su(\)enu ‘young’ < juvene; fume ‘smoke'< 
FUMU, kwante < QUANTU. 

Re-introduction of the final vowel is particularly frequent in Lombardy, 
and indeed in all urban dialects, where presumably hypercorrection plays 
its part. It may be that, here, as with the French definite article, the differ¬ 
ence between masculine and feminine final vowels was reinforced compar¬ 
atively recently, partly aided by etymological spelling. 

In cases like these, differences between the modern languages may very 
well have been the consequence of a modem discarding of an earlier vari¬ 
ant in some languages, which is retained in others. 

(d) Consonantal clusters and epenthesis 

We saw, in diseussion of palatalization (3.3), that in Romance consonantal 
sequences tend to be reduced. This is particularly so when there is a syllab- 
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ic boundary between the consonants. On the whole Romance prefers free 
syllables, so that a syllable-final consonant is often weakened or lost; this 
has been illustrated in connection with French nasalization, and French 
and Spanish opening of syllable-final sibilants. But already in Latin there 
was a tendency towards regressive assimilation of the syllable-final conso¬ 
nant to the syllable-initial, resulting in a geminate: 

(28) DORSU > Dossu ‘back’ (Dalmatian dims, Ital. dosso, Sard (Logudorian) dossu, 
French, Occ., Cat. dos)\ mensa > mesa ‘table’ (Rum. masd, Dalmatian maisa. 
Span., Ptg. mesa, Old French moise). 

This tendency has continued most strongly in central and southern Italy. 

Syllable-final r is the most resistant of the consonants (but even this 
sporadically falls or > [1], sometimes being restored by learned influence): 

(29) French arhre, pronounced in the seventeenth century as [a:br], now frequently 
as [aab] (see Ital. albero. Span, arbol)- 

Syllable-final [1] (presumably with a velar tinge) frequently vocalized to 
[u] and then usually merged with the preceding vowel: 

(30) ALTERU ‘other’: Span, otro, Ptg. outro, French autre. 

The vocalization of syllable-final / is a feature of modern Portuguese 
varieties. Sometimes, however, the resultant [w] is fronted to [j], usually by 
dissimilation: 

(31) MULTU ‘much’ > Ptg. muito. Span, mucho. 

In some Sard varieties syllable-final [r] replaces etymological /, whereas in 
standard Italian and Rumanian / usually survives: 

(32) ALTU ‘high’ > Campidanian artu. Rum. inalt, Ital. alto. 

Syllable-initial clusters ending in / frequently palatalize (see 3.3), but 
obstruent + r clusters usually persist. In Portuguese, r is substituted for / in 
those clusters which escape palatalization: 

(33) branco,fror ‘white’, ‘flower’ / Span. b}anco,flor. 

Given the resistance to retention of consonantal sequences, it is perhaps 
surprising that syncope (see 3.2) should have operated to create even more 
unfamiliar sequences, often heterosyllabic. In French, with frequent loss of 
atonic vowels, this process has been reversed to some extent by epenthesis of 
so-called e instable [o], which breaks up consonantal sequences (in an opera- 
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tion usually known, inexactly, as the Loi des Trois Consonnes, which implies 
that three-consonant sequences are regularly affected by epenthesis). 
Whether the [o] is present in the underlying morpheme structure, or is mere¬ 
ly a surface lubricant, is a matter for dispute. In favour of the latter solution, 
the non-standard insertion of non-etymological [a] in sequences like: 

(34) ours blanc [UKSsbIa] ‘polar bear’ 

is usually cited. It is to be noted that in modern spoken usage one fre¬ 
quently hears non-etymological [a] supporting a final consonant: 

(35) Agnes! [ajiezo]. 

I have already suggested (see 3.2) that there may have been similar 
epenthesis in Italian sdrucciole (with antepenultimate stress). In morpho¬ 
logical forms where syncope has brought consonants into contact there is 
frequently regressive assimilation in Italian: 

(36) vorrd < VOLERE+AT, verrd < venir+at. 

In the other modern languages there is a tendency to consonantal epenthe¬ 
sis in similar conditions, creating a syllable-initial cluster: 

(37) Span, voldrd, vended, French voudra, viendra. 

Spanish seems to have less distaste for consonantal sequences than 

other languages, but frequently regularizes them in radical ways, showing 
particular preference for syllable-initial Cr clusters: 

(38) HOMINE > hombre ‘man’, simulare > sembrar ‘to seem’. 

Indeed epenthesis, or metathesis, of r after a syllable-initial obstruent is 
sporadically found with consonant sequences in all the Romance lan¬ 
guages: 

(39) French tresor. Cat. tresor. Old Span, tresoro (modem tesoro) ‘treasure’ < 
thesauru; French frontage / Itahformaggio ‘cheese’ < formaticu; Sard 
(Campidanian) braba ‘beard’ < barba; Span, milagro ‘miracle’ < miraculu. 

In the earlier periods there was no clear-cut geographical differentiation 
between the ways in which consonant sequences and clusters were handled 
in Romance. While gemination or vowel epenthesis could be seen as ‘con¬ 
servative’, palatalization, /-vocalization, nasalization or consonantal 
epenthesis were viewed as ‘innovatory’. Within each area one or other of 
these tendencies prevailed, though not uniformly over all levels of the Ian- 
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guage. In Spain and France the prestige languages were on the whole more 
innovating, whereas in Italy and Rumania it was the more conservative 
reaction that was usually preferred. 

(e) Uvular r 

One feature of French that does not fail to strike the most casual observer 
is the character of the r-sound in the standard. In some contexts it is a 
uvular vibrant, and in others a velar flap or fricative. The other standard 
Romance languages use an apical [r]: in Spanish the distinction between 
rolled initial and geminate [r] and intervocalic retroflex flap [r] is salient. 
Commentaries about the pronunciation of R in Latin leave no doubt but 
that it was a rolled apical sound. Non-Parisian varieties of French also fre¬ 
quently use such a sound, and it is still generally used by singers, though 
the uvular version has gained ground rapidly. It is known that before the 
Revolution the non-apical r (grasseye ‘greasy’) had been, for at least a cen¬ 
tury, seen as an unprestigious Parisian middle-class feature, but it was 
accepted by the national language during the Revolutionary period. 

As uvular r is found in other European languages, it has been suggested 
that there may be some genetic distribution that would correlate with its 
use. Other explanations for its spread have emphasized the fashionable 
prestige of French. 

One suggestion, advanced by Andre Martinet, is that the uvular pro¬ 
nunciation derived from that of the geminate (paralleled by word-initial 
use) generalized to all positions within the word. A parallel can be drawn 
with some Occitan and Portuguese varieties, where indeed a distinction is 
drawn between apical simple r and uvular geminate rr. 

(40) Ptg. cam [karu] ‘dear’ / carro (ns.) [k^] ‘cart’. 

In French we have ample evidence that in popular central usage in the 
sixteenth century simple r had weakened to a flap or dental fricative or 
even sibilant [z]: the doublet 

(41) chaiselchaire ‘chair (furniture/symbol of office)’ < cathedra 

is a remnant of this variation. Much scorn was poured by contemporary 
commentators on local pronunciation of Paris as [pazi], and it was proba¬ 
bly then that the Parisian bourgeoisie began to react by using the ‘strong’ 
equivalent in intervocalic as well as in initial position. It was only when 
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this same class itself acquired prestige during the Revolutionary period 
that the uvular pronunciation became smart, and began to oust more con¬ 
servative rustic variants. 


(f) Devoicing of Spanish sibilants 

Modern standard Spanish differs from the other Romance languages in 
having no voiced fricative phonemes. The voiced fricatives [B], [ 6 ] and [y] 
are allophones of the voiced stops, occurring mainly in intervocalic posi¬ 
tion. In some varieties a voiced affricate [(^] is found for [j] < standard [A], 
as in: 

(42) [cfeuCia] Iluvia ‘rain’ < pluvia 
but its phonemic status is uncertain. 

Medieval voiced sibilants [z], [ck] and [ 3 ] were phonemically distinct 
from their voiceless counterparts in intervocalic position, as shown by the 
following minimal pairs: 

(43) espesso ‘thick’ < spissu / espeso ‘expense’ < EXPENSU; deQir ‘to go down’, 
derived from DESCENSUS / dezir ‘to say’ < DrcERE;^x:o ‘fixed’ < Flxu / fijo 
‘son’ < fTliu. 

The voiced reflexes devoiced by the seventeenth century, and [J] took on 
velar articulation (at first identified with the h reflex of Latin f: see 6.3), 
while [ts] became interdental [ 0 ]. 

It has been suggested that the devoicing of sibilants was due to the influ¬ 
ence of Basque, which has no voiced sibilants, and that it was current 
among the bilingual population of the newly reconquered Castile area 
from an early period. It would, however, have remained underground, 
owing to the more conservative usage of prestigious varieties, until the 
shaping of the innovative standard in the sixteenth century. It has alterna¬ 
tively been claimed that the devoicing of intervocalic sibilants stems from 
a generalization of regular devoicing in word-final position. 

Another possibility is to view the feature [voice] as redundant in the 
modern Spanish consonant system: in the stop series the contrast is 
between [tense] and [lax], with the latter having context-determined voiced 
stop or voiced fricative allophones. In the fricative series, the medieval lax 
members were used infrequently, and nearly always in intervocalic posi¬ 
tion. In the radically innovating standard language, the old distinction 
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could be dispensed with. Sometimes forms that showed it were abandoned 
(as with espeso and degir above). Sometimes, other differential marks were 
developed: 

(44) fijo ‘fixed’ / hijo ‘son’. 

But quite frequently homonyms were kept: 

(45) hozes ‘sickles’/’gorges’ < falces/fauces; cojo ‘lame’/‘I collect’ < 
COXUs/cOlligo; oso ‘bear’/'I dare’ < Crsu/ausO. 

The fronting of [ts] to [0] and the backing of [J] to [%] is seen also to be con¬ 
nected with a conscious attempt in the standard to differentiate the sibi¬ 
lants, which in medieval Spanish were sometimes confused. 

7.3 MORPHOLOGICAL CHANGES 

I have maintained (chapters 1 and 3) that one of the diagnostic features for 
Romanceness is morphological similarity. Within the ‘sameness’, however, 
there is room for differences. Here we shall look at irregularities in verb 
morphology and how they differ between languages. 

(a) Morphological irregularity 

Irregular verb forms in Romance, as elsewhere, are a bugbear for the for¬ 
eign learner, especially as they differ between the languages. Many of the 
so-called irregularities have arisen from phonological developments in the 
early periods of development. In some cases they may appear to have little 
synchronic patterning and even require analysis in terms of suppletion, 
but in others they conform to a limited range of alternative templates. 
Some languages appear to tolerate a wider range of templates than others. 

It is usually assumed that morphological irregularities place a burden on 
the acquisition process, involving separate memorization of the irregular 
items, rather than the formulation of more general rules. That they are 
maintained, except in the case of very frequent items, is often attributed to 
acculturation processes, by which the child learns that some regularized 
forms are unacceptable in the community. One might expect therefore a 
non-standard language to tolerate fewer irregularities than a standard, 
which asserts greater social pressures. Alternatively it is not unlikely that in 
dying languages (like many Romance standards today) irregularities are 
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imperfectly learned by new speakers who have insufficient access to fluent, 
prescriptive language use, and so analogical levelling is frequent. Neither of 
these explanations covers all cases, and the degree of irregularity is different 
in different paradigms within the same language. In the present tenses, for 
instance, it has been shown, from a sample of standard and non-standard 
Romance varieties, that Catalan, Occitan (Languedoc) and Italian have the 
most allomorphic variation, and French, Francoprovengal (Vaud), North 
Italian (Piedmont), South Italian (Lucania) and Rumanian have least. 

Some languages have been more radical in regularizing by analogical 
remodelling; some retain more accidental irregularities without change; 
others have incorporated more verb forms into regular templates, or have 
spread the use of regularized inflections. Here we focus on differentiation 
and analogical relevelling in the roots of present indicative and subjunc¬ 
tive (which are pan-Romance), and on the simple past, or preterite (which 
has fallen out of spoken use in a number of languages). Examples will be 
cited mainly from the standard languages. 

(b) Stem alternation in the present tense 

Allomorphy in the Romance present tense stems historically principally 
from accent shifts within the paradigm, with differential vowel changes, or 
from different treatment of consonants depending on their position within 
the word. Virtually all the Romance languages contrast rhizotonic persons 
1,2, 3 and 6 forms with persons 4 and 5, which are stressed on the thematic 
vowel. In many cases today the quality of the vowels of the accented (e.g. 
p. 3) and unaccented (e.g. p. 4) stems differ in a fairly predictable way: 

(46) Rum. tace [a] / tdcem [a]; Ital. feme [e], temiamo [e]; French it live [e]/ nous lev- 
ons [a]; Cat. passa [a] /passem [a]; Ptg. corta [a] / cortamos [u]. 

Moreover, the stem-stressed forms in some languages still today reflect the 
effects of earlier diphthongization (see 3.2,6.4): 

(47) Rum. doarma! dormim; Ital. possiedelpossediamo, muovelmoviaino; French il 
acquiertlnous acquerons\ il meurtinous mourons; Span, cierraicerramos, 
cuentalcontamos. 

In French there was extensive remodelling of paradigm apophony of 
this type in the early modern period, so that, except in comparatively few 
verbs, there is uniformity of stem throughout the paradigm: the choice of 
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model for remodelling seems pretty random. In Spanish, on the other 
hand, though there has been levelling, there has also been some extension 
of the alternating pattern to verbs which were not originally affected by 
diphthongization: 

(48) piensalpensamos < pensare ‘to think’; cuelalcolamos < colare ‘to filter’. 

Spanish also has another template for vocalic alternation in -ir verbs, 
which involves the raising action of [j], with analogical spread to other 
parts of the paradigm: 

(49) pidetpedimos (see Ptg. stgolseguimos, durmoldormimos). 

Yet another Spanish template involves a three-way variation, elieli, oluelu, 
with the high monophthong appearing in the present subjunctive ending- 
stressed forms: 

(50) sientelsentimoslsintamos; duermoldormimosidurmamos. 

Even more striking are some of the effects of consonant changes on the 
verb paradigm. In French where consonants were lost in final position we 
have alternations like: 

(51) dorl [dOK] / dormons; veut [v 0 ] / voulons; boit [bwa] / buvons. 

Palatalization of velars by following front vowels (see 3.3) also makes for 
alternation, as in: 

(52) Ital. dico, dica [k] / did [J], Span, digo, diga / dices [0]; Ital. leggo, legga / leggi 

[dch]. 

Note that in the present subjunctive Latin e does not trigger palatalization: 

(53) Ital. paghi. Span, niegue 

One feature worth special mention is the ‘hardening’ of [j] to a velar: 

(54) venIo > Ital., Span, vengo. Cat. vine [bii)] / Old Ital. venho, Ptg. venho [ji]; Ital. 
salgo < salIo/ Old saglio, cf. voglio < ’"voleo. 

Here there may be analogical influence from forms with etymological g like: 

(55) Ital. colgolcogli [A] (colligere). 

However, in Catalan, for instance, we have collo [X] not *colc. 

In Catalan, Spanish and, to a lesser extent, Italian the patterning with a 
velar in person 1 of the present indicative and in the present subjunctive 
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paradigm has spread to other verbs where there is no etymological motiva¬ 
tion: 

(56) Ital. sciolgo (sciogliere, older solvere). Span, pongo (pOnO), traigo (older trayo) 
(TRAHO), Cat. caic (cadO), crec (credO), escric (scrTbO). 

In Spanish some verbs have been incorporated into the pattern, whereas 
others have left it: 

(57) Old oyo / modem oigo (audio); Old fiergo I modern hiero (ferio). 

The effect of analogical reformations, although increasing the pre¬ 
dictability of morphological patterning in each language, often reduces the 
similarity between languages. Before standardization of the languages, 
there was more variation in each of the languages and overlap between 
them, but in the modern period usually only one of the variants is regard¬ 
ed as correct, and different languages tend to opt for different patterns. 
French on the whole has preferred levelling of the paradigm with little dis¬ 
tinction made between persons, whereas Catalan makes greater use of 
stem alternation within the paradigm; Italian and Portuguese are compar¬ 
atively conservative (that is, they favour inherited forms), whereas Spanish 
is comparatively innovative, though not consistently so, in adopting newer 
templates. 


(c) Irregularity in the preterite forms 

The Romance preterite (simple past) forms are another example of the dif¬ 
ferential action of sound change and analogy in the different languages. 
Here though, unlike for the present tenses which are in current use in all 
the languages, there is also the added complication that in some languages 
the tense forms carry a distinct stylistic aura, which may help them to 
retain exotic morphology. 

The Romance preterite (passato remoto, passe historique, passe defini 
etc.) is derived from the Latin present perfect, which incorporates the 
Indo-European aorist. Its descendants have distinctly perfective meaning 
to some extent only in Portuguese: there is some evidence that this repre¬ 
sents a modern reaction, rather than simply a survival of the Latin use. 
Elsewhere it is replaced in perfective function by the compound perfect 
(see 3.3). Where it survives it is used mainly as a punctual, or as an incep¬ 
tive, past tense. It has fallen out of use in north Italy (including Rhaeto- 
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Romance) and Sardinia, and is not used in spoken Rumanian, Catalan or 
French, though still used in formal written style. In more colloquial 
Catalan it was replaced from about the seventeenth century by a 
periphrastic form composed of the verb ‘to go’ and the infinitive, which in 
the medieval period had ambiguous time referenee (see 4.6). In the other 
languages, including the Catalan of Alghero in Sardinia, it has been 
replaced by the compound perfect, which has extended its use to refer also 
to past punctual events. 

There were many irregularities in the Latin perfect stem, most of which 
were inherited from Indo-European. The personal inflections, on the other 
hand, are predictable: 

(58) -I, -ISTt, -IT, -IMUS, -ISTIS, -BRUNT. 

In Italian these endings develop fairly regularly, though there is some con¬ 
traction in the regular stems, and a paragogic [o] is added to the person 6 
ending: 

(59) -AVIT > [aut] > [o]; -avimus > -ammo: -Averunt > -arono. 

In Spanish and Portuguese the putative person 3 [aut] ending is extended 
to regular verbs in the other conjugations (which are merged in this tense) 
and in Spanish also to irregular verbs: 

(60) Span, dormid, Ptg. donniu ‘he slept’ < dormivit. Span, dijo / Ptg. disse ‘he 
said’ < DlxiT. 

Italian also has a few verbs with theme vowel [e] which have optional end¬ 
ings that are apparently derived from the perfectum of stare ‘to stand’ 
(stEtI): 

(61) vendetti /vendei < vendIdI ‘I sold’. 

Rumanian and Catalan have extended the [r] of the Latin person 6 to all 
plural inflections. In Catalan there is extension of the velar element used in 
the present subjunetive (see above) to many preterite stems: 

(62) rigui, volgui (pres. subj. p. 1) / rigid, vulgui (pret. p. 1) {riure ‘to laugh’ < 

RtDERE), voter ‘to want’ < *volere, for velle. 

Latin common ‘strong’ verb perfect stems tend to survive, enhancing the 
degree of synchronic irregularity in the modern Romance preterite. In 
Spanish and French metaphonic raising of the stem vowel by the person 1 
ending I is extended through the paradigm: 
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(63) Span, hice, hizo, tuve, tuvo; French fis,fit (older fist), tins, tint < fecI, FECIT, 
tenOi, TENOlT/Ptg.^z,/&, tive, teve. 

In Spanish high stem vowels [i] and [u] are sometimes seen as almost mark¬ 
ers of the preterite: 

(64) hube < habOi; estuve (Latin SYiJi),puse < POsOi, vine < VENt, di < d6dI ‘had’, 
‘was’, ‘put’, ‘came’, ‘gave’. 

The greatest degree of irregularity among the reflexes of the Latin 
strong perfects is found in Italian, where there is stem alternation in the 
paradigm, between stem-stressed and ending-stressed forms: 

(65) conobbi, conoscesti ‘I, you knew’ (< coGNOvr, -ISTI), scrissi, scrivesti ‘I, you 
write’ (scrips!, -iSTt), misi, mettisti ‘I, you put (mIsI, -istI), feci,facesti ‘I, you 
did’ (Fgcl, -iSTi); ebbi, avesti ‘I, you had’ (habOi, -istI). 

Whereas Portuguese alternation can be called conservative, the Italian 
alternation is an innovation and is sometimes explained as analogical to 
the alternation induced by regular phonological change in, for instance: 

(66) tenni, tenesti (tenOI, -istI) 

where the ending-stressed form of the stem is identical with the present 
stem. Others claim that the present-stem resemblance is merely accidental, 
and that it is the stem-stressed preterite forms that have most resisted the 
levelling which had already affected the Italian imperfect subjunctive: 

(67) facessi (fecissem) / Span, hiciese, Ptg.fizesse. 

Only a few strong preterite forms do not fit into the usual alternating pat¬ 
tern: 

(68) fui,fosti ‘I was, you were’ (fOI, -istI), diedUdetti, desti ‘I, you gave’ (dedI, -istI), 
stetti, stesti ‘I, you stood’ (stetI, -ISTI). 

These happen to be the verbs that do not use the present stem in the imper¬ 
fect subjunctive: 

(69) fossi, dessi, stessi. 

Here, even more than in the present, then, the interaction of sound 
change and analogy has increased the distance between forms in the mod¬ 
ern languages. 
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7.4 SYNTACTIC CHANGES 

Syntactic structures are fairly uniform across the Romance languages. It is 
usually possible to isolate the small differences that exist, and relate them 
to morphological, or eventually phonological, differences. Here we shall 
examine some deviations in the syntax of negation, relativization and 
interrogation that single out French as different from mainstream 
Romance. 

(a) Negation 

In Latin traditional grammar, sentential negation is seen as effected pri¬ 
marily by the ‘negative adverb’ non. This survives as a negator in all the 
Romance languages; where it is used in sentence negation, it normally 
occurs, as in Latin, in preverbal position. A n(e) prefix also serves to 
negate other elements, such as (n)umquam ‘(n)ever’, (n)ullus 
‘none/some’. The former of these survives in Spanish and Portuguese 
(nunca) and perhaps in Arumanian {ningd, ni(n)cd) and the latter, with 
rather literary flavour, in a number of languages. In Walloon nuk is used 
for ‘nobody’ (as distinct from [reg]); the neuter plural form nulla is used 
for ‘nothing’ in Dalmatian, Sard and Friulian Rhaeto-Romance: 

(70) Veglia Jal non kapua nolla at majo diskurs ‘They didn’t understand anything I 
said’; Sard Es tantu tontu ki non inde gela nudda ‘He is so stupid he doesn’t 
understand anything about it’: Friuli Nol ves sintut nuje di nuje ‘He had heard 
nothing at air. 

Nihil ‘nothing’ probably survives nowhere. Nemo ‘nobody’, which was 
already defective in Latin, has modern reflexes only in Sard (nemmos) and 
Rumanian (nimeni < nemine). More usually, new morphologically nega¬ 
tive pronouns are formed by prefixing a negative element: 

(71) ‘nobody’: Ital. nessuno < ne-ipse-unu; Rum. dial. (Arumanian) nitiun. Span. 
ningmo. Ptg. nenhum. Old French niun, Occ. degun < nec-unu; ‘nothing’: 

Ital. nienle, French neani (now used only as a lexical noun) < nec-ente or nec- 
gente; Rum. nimic < ne-mica. 

What is striking, though, is the number of Romance negative items that 
are derived from etymologically positive ones. This is particularly so in 
French, which has lost most of the inherited negative items. Originally 
polarity items associated with non in sentence contexts, they have often 
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taken on negative semanticism when used alone. Today they can still be 
used with non, but the effect is of negative concord, rather than of double 
negation. These items often derive from nominals and include: 

(72) Span., Ptg. nada ‘nothing’ (originally from (res) nata, ‘thing bom’, cf. also 
Span, nadie ‘nobody’, earlier nadi)\ French personne ‘nobody’ / unepersonne ‘a 
person’, rien ‘nothing’ < REM ‘thing’, pas, pomt ‘not’t un pas, unpoint ‘a step, a 
point’; Ital. mica ‘not at all’ / una mica ‘a crumb’; Cat. gens, cap ‘none’ < gens 
‘people’, CAPUT ‘head’. 

Negative adverbs like Italian mai, French (ja)mais ‘never’ (< magis ‘more’) 
developed in the same way, but we shall consider only the development of 
the nominals. 

Acquisition of negative meaning by such items results historically from 
their very frequent use in postverbal position in the scope of a preverbal 
negative non. The proeess is of course well known from other languages, 
like Germanic. Nominals originally designating some insignificant item (as 
in ‘not a jot’) were frequently used to reinforce the negative in the medieval 
languages. Among those that survive as negative elements in Romance are: 

(73) GUTTA ‘drop’, BECCU ‘beak’, cornu ‘horn’. 

Optional use of postverbal reinforcers is known to all the Romance lan¬ 
guages and was particularly common in the Middle Ages. 

When the reinforcing element is fronted in modern standard Italian, Spanish 
and Portuguese it ‘absorbs’ the non reflex (see 3.4). 

By early modern French the use of such reinforcers had become virtually 
obligatory, and ne < non was cliticized to the finite verb. Some languages, 
notably non-standard French, Occitan and some Rhaeto-Romance and 
Northern Italian varieties, have gone further and lost the non reflex in pre¬ 
verbal position, leaving the old ‘reinforcer’ (called in French the forclusif 
marking the scope of negation) as the only negator; 

(74) R-R (Surselvan) Capesch buc (bucca) ‘I don’t understand’; North Ital. dial. 
(Turin) Lu saj neng (nec-ente) ‘I don’t know it’; Occ. (Gevaudan) Be pa 
(passu) ‘He isn’t coming’. 

Usually this is connected with phonetic weakening and cliticization of 
non, which tended to favour obligatory use of the reinforcer, which in its 
turn led to the neglect of the weakened preverbal negator, which became 
redundant. There is direct evidence of the loss of the preverbal negative 
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element in southern France and Switzerland from the sixteenth century, 
and it may be that in colloquial French seventeenth-century use, the ne was 
already regularly dropped in negative sentences, though the standard con¬ 
tinues to dictate its use. In northern Italy loss of the preverbal negative 
may have occurred more recently; in some dialects reliance on the postver¬ 
bal negator is more complete than in others, and is subject to a variety of 
constraints (for instance, the subjunctive sometimes requires ‘weaker’ 
negation, without a reinforcer, than the indicative). 

A not dissimilar process is found in some modem Portuguese varieties, where 
negation is marked by nao at the end of the clause, representing a simplifica¬ 
tion from the widespread emphatic echoing of preverbal nao. 

It has also been suggested that the shift in placing of the negator to 
postverbal from preverbal position foreshadows a typological change from 
OV to VO ordering, and that non-standard French, in particular, has 
taken this process furthest. In Surselvan Rhaeto-Romance, a V2 language, 
the influence of German is plausibly suggested. 

Current conventional wisdom among generativists is that negation can 
be represented as a (universal?) non-lexical functional projection, which is 
ordered hierarchically above the T(ense) projection. The head of this pro¬ 
jection would be the negator (non), but in specifier position there would be 
an ‘operator’ (the forclusif e.g. pas). When the (finite) verb raises to receive 
(or to ‘check’) tense and agreement features, the head negator raises with it 
(and may cliticize to it), whereas the operator remains in postverbal posi¬ 
tion. The difference between, say, Italian and French negation could there¬ 
fore be resolved simply by postulating that in Italian the non element can 
negate on its own, and the operator can be non-overt (i.e. absent in the sur¬ 
face representation), whereas in French the ne needs the overt presence of 
an operator (e.g. pas) to fulfil its negative function. In some languages (e.g. 
non-standard French) it is the negative head (ne) that is nonovert, so that 
the whole burden of negation falls on the postverbal operator. 

This hypothesis does not adequately account for the similarity in distri¬ 
bution between the forclusif elements and adverbials, nor for the intuition 
that it is these elements that are the true negators in some languages. Note 
that in the French creoles the pas reflexes serve as preverbal negators (see 
2.2), whereas in standard French a sole ne element seems little more than a 
decorative expletive, in complement clauses dependent on some verbs or 
introduced by a conjunction with ‘negative semantics’ (see 3.4). 



How did the Romance languages differentiate? 


305 


It also leaves us with the old-fashioned problems of explaining how the 
‘head’ negator lost overt expression in some languages, and why others 
prefer (as was normal in Latin) non-overt expression of the operator. Loss 
of the NON reflexes is not always preceded by phonological weakening 
and/or cliticization, but it appears to be correlated in many varieties with 
subject cliticization and with use of the ‘adverbial’ clitic inde (see 6.5), 
which often takes on a form easily confused with the non reflexes. 

The modern timing of the change from preverbal to postverbal negation 
in the affected languages suggests comparison with the syntactic changes 
discussed in chapter 6. In the early modern period, while some languages 
elung to more archaic patterns, others opted for innovating variants. Here 
there is also a diseernible differenee between the standards and non-stan¬ 
dards. The latter, on the whole, tend to eliminate redundancy, but tolerate 
variation. Thus in the dialects of north Italy either non or the forclusif or 
both, may be permissible, according to context. The conservative stan¬ 
dards tend towards economy when this squares with etymology, so the pre¬ 
verbal NON negator is preferred but may be absorbed by a fronted negative 
element, and the use of ‘wrmQC&ssdity’ forclusifs is frowned on. 

The innovating standard, French, on the other hand, insists on the overt 
expression of both elements, whenever the verb is present. Comment has 
often been made about the ‘hypercharaeterizing’ tendency of standard 
French, with its insistence on clarity and overt expression of semantic ele¬ 
ments. In most French usage omission of ne is variable, depending much 
on region, register and socio-eeonomie factors, but especially on degree of 
education. Small children seem to learn the use of the negative without ne, 
but incorporate it into their usage, as a stylistic variant, at a late stage of 
acquisition. 

Note too that whereas older French often used a clitic ne(n) with a pronoun as 
a ‘no!’ answer to a question (nen-il, ne-jo ‘not-he’, ‘not-F > nenni), the modern 
language uses only the full stressed form non (or even, since the sixteenth cen¬ 
tury, non pas, with simply pas as a possibility in modern colloquial usage). 


(b) Relatives 

Another example of hypercharacterization in French is the distinction 
made between subject and object relative pronouns qui and que, whereas in 
the other Romance languages (including non-standard French) one form 
is used for both. 
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(75) French la femme qui est (ns. quest) venue / que jai vue\ Ital. la donna che e venu- 
ta I che ho visto; Rum. doamna care a vend / care am vdzut; Span, la mujer que 
vino I que vi ‘the woman who came / whom I saw’. 

There is a case difference in Engadine Rhaeto-Romance between [J'i] and [tfa], 
the former identical with the interrogative ‘who?’ and the latter with the com¬ 
plementizer. In standard Catalan qui may be used as a subject relative after a 
pronoun referring to a person, but in the colloquial language non-case-marked 
que, identical with the complementizer, is current. In Sard, as in many creoles, 
a non-case-marked [ki] form is used as relative pronoun and complementizer. 

There was a tendency to merge the Latin relative pronouns qui, quae, 
QUOD and the interrogatives QUis, QUis, quid. In the Romance interroga- 
tives a distinction is usually made between human: 

(76) QUI > Ital. chi, Sard kie, French, Cat. quv, quem > Rum. cine. Span, quien, 

Ptg. quem 

and nonhuman reference: 

(77) QUID > Ital. che, Sard ([v])i»e, French que. Rum. ce. Span, que, Ptg. (o) que. 

No such distinction is usually maintained for the relatives. 

The dative pronoun (< cui) survives in Rumanian as an interrogative 
and in Italian as a postprepositional relative form. In modern French it 
merges with qui in this latter function, with human reference: 

(78) I'homme d qui j'ai donne le livre I Ital. I’uomo a cui ho data il libro. Span, el 
hombre a quien dio el libro. 

In Spanish and Portuguese gender- and number-marked reflexes of the gen¬ 
itive singular cuius survive as both relative and interrogative pronouns: 

(79) cuyo, -a, -os, -as; cujo, -a, -o, -as. 

Alternative forms of the relative have been developed from qualis ‘of 
what kind?’, preceded, in the west, by the definite article which allows 
expression of gender and number: 

(80) Rum. care I Ital. il quale, R-R (Engadine) il quel, French lequel. Span, el cual, 
Ptg. o quai 

Central French relative qui is often seen as a mere surface variant of que, 
which is used as a universal complementizer in other dialects, and 
increased its scope in texts between the thirteenth and the sixteenth cen¬ 
turies. Qui is sometimes used only as a masculine animate subject pro- 
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noun, with que preferred elsewhere. The triumph of qui as a subject rela¬ 
tive has been linked with the non-pro-drop character of modern French 
(though, except for the Engadine, there is no parallel in other non- 
pro-drop Romance languages). The que > qui change (sometimes known 
jocularly as mas-QUE-rade) occurs not only when the relative functions as a 
subject, but also in examples with subject extraction, like: 

(81) I'homme que j’ai dit qui est venu I Ital. I’uomo che ho delta che e venuto ‘the man 
that I said came’; Qui as-tu dit qui est sorti de bonne heure? I Span. iQuien dijiste 
que solid tempranol ‘Who did you say went out early?’ 

Here the qui seems to stand for qu’il, which indeed is found in this position 
in early modern French texts and in the non-standard today; in older texts 
que was usual. As il was usually pronounced as [i], it could be that the 
spelling qu’il merely represents a mistranscription. However, there are par¬ 
allel examples with the feminine pronoun {quelle), which suggest that it is 
perceived as a combination of the complementizer que and a pronoun. 
This type of construction was condemned by seventeenth-century gram¬ 
marians, who preferred infinitival complementation (as in que fai dit etre 
venu see 3.4). In modern written style a different construction is advocated: 

(82) I'homme dont j’ai dit qu'il est venu ‘the man of whom I said that he came’. 

A similar curious use of French qui is found with the so-called ‘attribu¬ 
tive relative’ as in: 

(83) Je vois les enfants qui sortent de I’ecole / Rum. ti vddpe copii ca ies din sfoald; 
Ital. Vedo i ragazzi che escono dalla scuola'. Span. Veo a los nihos que salen de la 
escuela\ Ptg. Vejo os meninos que saem da escola ‘I see the children come out of 
school’. 

Here the qui acts more like a complementizer than a relative: 

(84) Je vois que les enfants sortent de I’ecole ‘I see that the children come out of 
school’. 

Medieval texts in the languages other than French also show sporadical¬ 
ly nominative case-differentiated relative pronouns, but it may be that the 
written usage was influenced by Latin. In the late medieval period in 
French there were signs of the spread of the use of relative que to subject 
function, but this was blocked at the time of standardization. 
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(c) Interrogative markers 

One consequence of the specialization of French relative qui as a subject is 
that interrogative que?, used for inanimates, cannot comfortably in the 
modern language be used as a subject. 

(85) *Que vans ennuie? ‘What is worrying you?’ 

has to be rendered: 

(86) Qii’est-ce qui vous ennuie? ‘What is it that is worrying you?’ 

where the inanimate interrogative que has to be bolstered by case-marked 
relative qui. Even with que? as an object the periphrastic form is more nor¬ 
mal in colloquial style: 

(87) Qiiefaites-vous? I Qu’est-ce que vousfaites? ‘What are you doing?’ 

This can again be regarded as an example of hypercharacterization. 

Modern est-ce que? can be analysed as an interrogative marker [esk], 
rather than an inversion of c’est que, which normally is used with senses 
like ‘That’s the reason that... ’ or ‘It’s the situation that... ’ 

In non-standard French qu’est-ce que, qui est-ce que can also be used as rela¬ 
tives. 

The development of the interrogative marker [esk] in French can be seen 
as an example of exaptation, by which use is made of an old form for new 
functions. It dates from the sixteenth century, when, as we saw (6.4), the 
Old French use of inversion of lexical subject and verb in polar interroga- 
tives had fallen into disuse. In WH- (QU-) questions such ‘stylistic inver¬ 
sion’ can still be used (except with pourquoi? ‘why?’), but is less usual than, 
and sometimes carries different implications from, so-called ‘complex 
inversion’, in which the lexical subject appears to be copied by a pronoun: 

(88) Quand votrefils est-ilmart? / Quand est mart votrefils? ‘When did your son die?’ 
(in which the latter would be insensitive, seeming to take the death for granted). 

Use of interrogative inversion of lexical subjects in polar questions was 
infrequent in Old French, while inversion of pronouns was usual. In such 
cases, unlike declarative sentences with inversion (triggered, for instance, 
by the V2 requirement), overt expression of the pronoun was frequent in 
questions. There are a few cases in the earlier period of dislocation of a 
lexical subject in a polar question, with a pronoun copy in postverbal posi- 
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tion, but it is not till the fourteenth century that such ‘complex inversion’ 
became dominant. In modem spoken French, as in most other Romance 
languages, polar questions differ from declarative sentences mainly by 
their intonation, but the standard language still dictates inversion of the 
subject pronoun, and, when a lexical subject is present, complex inversion. 
Non-inverted questions were rare in Old French texts (and difficult to dis¬ 
tinguish from emphatic statements), but they increased in frequency from 
about the fourteenth century. 

When the subject pronoun is je inversion is no longer used with -er verbs, and 
uncommon with others: 

(89) (older) Porte-je...? [poKtes] / Est-ce que je porte...? ‘Am I carrying... ?’ 

The change from use of simple to complex inversion can be seen as part of 
the trend to ‘direct’ (SVO) ordering in the early modem period. By the six¬ 
teenth century, an epenthetic -t- was inserted before inverted third person 
pronouns in interrogative and similar constructions: in popular usage the [ti] 
element (-t-il) became an interrogative marker. This remained in common 
use in popular French until recently, and is still used in Canada (especially in 
c'est-i. .. .^^ ‘is it... ?’, for est-ce ... ?). In Canada the modem pronunciation 
is [ty], possibly from confusion with personal pronoun tu. In more formal 
French the [esk] interrogative marker with no inversion is much used, 
though it is regarded as emphatic and rather clumsy in written style. 

The Latin interrogative markers ne, nonne, num left no trace in 
Romance. The WH- forms survive better, usually with no distinction 
between interrogative and other forms; besides QUi and quid (see above), 
QUANDO ‘when’, and ubi? or unde? ‘where?’, ‘whence?’ have reflexes in 
most languages: 

(90) Rum. ctnd, (Old) iu, unde; Ital. quando, (d)ove, (Old) onde; Sard kando, 
(Logudorian) ube, (Campidanian) undi; French quand, oil, (Old) ont; Span. 
cuando, (Old) o, (d)onde, Ptg. quando, (Old) u, (d)onde. (Swiss R-R cura? 
‘where?’ is a derivative of hora ‘hour’.) 

Cur?, quare? ‘why?’ are replaced in most languages by new derivatives: 

(91) Rum. pentru ce, Ital. percM, Sard (Logudorian) proittu, French pourquoi. 

Span, porque, Ptg. porque ‘for what?’ 

Quomodo? ‘how?’ survives everywhere in some form (in Sard and French 
with the adverbial suffix mente (see 2.3)). Quantus? ‘how much?’ is gener- 
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al, except in modern French, which has created a new compound 
combien?, from bene ‘well’, used as a quantifier (see bien des chases ‘many 
things’). 

Some non-standard languages have also developed new interrogative markers 
for polar questions: in Sard, for instance, sentence-initial [a], and in the 
Occitan of Gascony [e] (in this latter set of dialects declarative clauses are also 
usually introduced by a particle [ke]). In many North Italian and Rhaeto- 
Romance varieties interrogative markers have developed from enclitic pronoun 
forms (see 1.2). All of the Romance languages frequently use tags (usually 
‘yes?’ or ‘no?’) to emphasize polar questions. 

7.5 HYPERCHARACTERIZATION, SIMPLIFICA¬ 
TION AND EXAPTATION 

Simplification and elimination of redundancy seem to proceed in the 
opposite direction from hypercharacterization, which tends to increase 
redundancy, which in its turn may be reduced (as in the case of the drop¬ 
ping of French ne in non-standard usage, which relies on pas to convey 
negation). Alternatively it may favour functional differentiation of the 
redundant variants. Exaptation, or as Roger Lass graphically puts it, 
‘making things out of junk’, is found in the history of every natural lan¬ 
guage. The term in evolutionary theory was coined to refer to the co¬ 
optation, for new purposes, of struetures that were developed for other 
purposes but then became redundant. In this section we shall examine 
some examples of how these processes have helped differentiate the 
Romance languages. 

(a) Spanish imperfect subjunctive forms 

A remarkable feature of Castilian Spanish is the retention of two separate 
imperfect subjunctive forms (in -se and in -ra) for one function. The two 
forms in the standard language are stylistic variants, though the -ra form is 
preferred in some areas, especially America, and in expressions of polite¬ 
ness (like quisiera ‘I’d like’). In very formal written Spanish the -ra form 
can also be used with pluperfect indicative meaning (which was normal in 
Old Spanish, but archaic by the seventeenth century). 

The Latin imperfect subjunctive (e.g. cantarem) apparently survives 
only in Sard and Portuguese (see 4.3 for the ‘personal infinitive’). 
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Normally the pluperfect subjunctive (e.g. canta(vi)ssem) moves into its 
slot as a past subjunctive, though in Rumanian it is used as a pluperfect 
indicative, and a compound form with a fi ‘to be’ is used for the past sub¬ 
junctive (see page 17). In Rhaeto-Romance varieties the forms are used as 
‘conditionals’, a function that in other western languages is filled by 
future-in-the-past forms, apparently grammaticalized from infinitive -t 
habere forms in the imperfect or imperfect (see 4.6): 

(92) French chanterait. Span., Ptg. cantaria < cantare + habebat; Ital. canterebbe 
< CANTARE + HABUIT. 

In Rumanian the origin of the auxiliary in the conditional suggests a form of 
*volere, as in the future, or a past subjunctive of habere: 

(93) cintaras -ai, -ar / ay, ai, ar cinta. 

However, the Latin pluperfect indicative (-ra) forms (canta(ve)ram) 
appear early to have taken on modal functions, especially in conditional 
clauses. This latter use survives especially in southern Italy, but it is also 
found in Old Tuscan and Old Catalan and Occitan texts. In modern stan¬ 
dard Catalan some forms of common verbs can still be used instead of the 
more usual conditional: 

(94) fora, haguera for seria, hauria ‘I would be, have’. 

In non-standard Argentinian Spanish, on the contrary, the conditional form 
can substitute for the -ra subjunctive. 

In the other Iberian languages -ra forms moved further into the territory 
of the subjunctive. In the Catalan of Valencia they ousted, by the eigh¬ 
teenth century, the -se subjunctive forms. Whereas in standard Spanish 
both forms subsist as equivalents, sociolinguistic surveys show that, in 
Seville for instance, the -ra forms are far more frequently used. In Castile, 
on the other hand, the -se forms are current. 

For the pluperfect indicative the cantaveram forms are usually replaced in 
Romance by the past form of the ‘compound past’ (see 3.3), with the auxiliary 
in the ‘imperfect’ tense. They survive precariously in modem formal written 
Portuguese, and even more marginally in Spanish (see above). There are exam¬ 
ples in old texts, especially in Sard, and the very earliest French texts seem to 
use cognate forms as a past punctual tense. 
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(b) Ser and estar 

The Iberian languages show an interesting instance of hypercharacteriza¬ 
tion that is unknown to the other Romance languages. This is the develop¬ 
ment of two separate words for the copula {‘atributivo’) / auxiliary ‘to be’. 
The distinction between ser {esser in Catalan) and estar forms causes some 
confusion to speakers of other Romance languages, especially as their use 
is not identical in the three languages. 

In all the medieval Romance languages there was some tendency to con¬ 
fuse the inherited ‘to be’ (esse) forms with those of SEDERE ‘to sit’, in par¬ 
ticular in the future and the present subjunctive forms. In Spanish and 
Portuguese older seer completely merged, in the early modern period, with 
ser, and a derivative of sedere (sentar) provides the modern word for ‘to 
sit’, but in Catalan seure (< ’"SEDfiRE?) is clearly distinguished in the mod¬ 
ern language from hser. 

Although {es)ser remains the most common be form, there has been 
invasion of its traditional territory from medieval times by stare ‘to 
stand’. In 4.5 we saw that this verb is widely used with the present partici¬ 
ple to convey the present progressive (‘is doing’); it is also used in a 
periphrastic immediate future ‘to be on the point of’: 

(95) Span., Ptg. estar para canlar. Cat. estar per cantar, Ital. sta per cantar. 

Participial forms, in particular, of stare are frequently incorporated sup- 
pletively into the esse paradigms: 

(96) French etant < stante, Ital. state < statu 

while as a full lexical verb in its Latin meaning it is often replaced by other 
expressions. 

In Rumanian a sta can mean ‘to sit’ as well as ‘to stand’, whereas the usual 
word for ‘to sit’ a sedea can mean ‘to stay’, like the stare forms in other lan¬ 
guages. 

In the Iberian languages, the two be forms are differentiated, except in the 
Catalan past participle: 

(97) estat / Span. Ptg. side, estado. 

Although their function was not always clearly distinguished before the 
modern period, in the modern standard languages they are said to differ 
semantically broadly along the lines of ‘permanency’ versus ‘changeabi- 



How did the Romance languages differentiate? 313 

lity’, reflecting, it is claimed, the etymological distinction between ‘sitting’ 
and ‘standing’. Individual uses do not always, however, fit smoothly into 
this pattern, and there is considerable variation lexically, regionally and 
between speakers. This is particularly so in Catalan. 

The morphological passive requires the auxiliary, whereas the use 
of estar with the past partieiple implies stative, resultative adjeetival func¬ 
tion. (^Es)ser is used with noun complements and estar, with locative 
adverbial complements (in Portuguese and Catalan, and probably in early 
Old Spanish, only with animate subjects). With adjectives it is not always 
easy to predict the choice of copula: sometimes ser and estar are optional 
alternatives and sometimes they signal different uses of the adjective. 

Castilian seems to have been most consistent in the spread of estar to 
more eontexts, at the expense of ser. The late and hesitant movement of 
Catalan in this direction suggests influence from Castilian, though it is 
also argued, because of parallels in Occitan, that the development was 
early and independent of the other Iberian languages. Portuguese does not 
show evidence of influence from Castilian, except in so far as the early 
development of both languages went hand in hand. Modern Portuguese 
also uses ficar (originally ‘to fix’), apparently as a substitute for estar, a 
similar use of fincar (modern hincar) was found in older Spanish, replaeed 
today by quedar ‘to stay’. 

It is tempting to link the creation of two be verbs with the loss of the use 
of ser as a perfect auxiliary in the Iberian languages during the early mod¬ 
ern period, and with the reduced use of the be passive in these languages 
(see 4.7 for the se passive). Another factor that may have played a part is 
early loss in the Iberian languages of full lexical meaning of estar, whereas 
in Old Italian and Old French texts the cognate form is used for ‘to stand’ 
and ‘to stay’. Yet another factor is the extreme irregularity (with supple- 
tion) of the {es)ser forms, whereas estar fits into familiar morphological 
templates and may therefore have extended its distribution. We have 
already seen that Catalan and Spanish tend particularly to favour analogi¬ 
cal remodelling. 

Note here that in Gascony the estar stem has been introduced even into the 
simple past of the ‘to be’ verb: 


(98) 


esteilestoui for houi < fuT. 
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(c) Comparatives 

Reduction of redundancy can lead to differentiation between languages 
that choose to perpetuate different alternative forms. An example can be 
found in expressions of comparison of inequality in Romance. In Latin a 
nominal complement (or target) of the comparison could be introduced 
by the conjunction quam, or it could appear in the ablative case. In the lat¬ 
ter (‘phrasal’) construction the nominal can be interpreted as the location 
from which the comparison starts (‘from A, B is -er’). In the former case it 
looks more as if there has been ellipsis of the verb phrase in a complement 
clause, with the quam acting as a link particle, or pivot, like English than 
(‘B is -er than A is -’). 

In later Latin the genitive case sometimes replaces the ablative in the 
comparative phrasal complement. All the Romance languages continue to 
some extent this use, with the preposition de acting as a case marker. The 
clausal introducer quam survives as a distinct particle in some Romance 
languages (especially in older texts), but the comparative clausal link is 
most often effected by what looks a WH- (QU-) complementizer, appar¬ 
ently introducing a subordinate clause. This coincidence of forms suggests 
that there has been movement, as in relative clauses, of part of the comple¬ 
ment clause, rather than ellipsis. In virtually all the languages, at some time 
during their history, the QU- complementizer was used regularly to intro¬ 
duce a complement clause with an overt finite verb. 

In Romance, analytic constructions for most adjectives replace the Latin syn¬ 
thetic comparative forms (magis used in the Iberian peninsula and Rumania, 
and PLUS elsewhere). 

In most languages both introducers {de or QU-) are found in old texts, 
sometimes apparently interchangeably. There is also some limited exten¬ 
sion of the QUOMODO particle used in comparison of equality into the 
realm of comparison of inequality. Though the complementizer normally 
introduced a complement clause, it was, however, sometimes omitted in 
older texts, in the presence of an ‘expletive’ or ‘pleonastic’ negative ele¬ 
ment, used perhaps to accentuate the differenee between the two compar- 
ands (‘B is -er than A is not -’). Omission of the complementizer was most 
frequent with an overt verb in the subjunctive (‘B is -er than A may not be -’). 
In Old Italian we find examples like: 

(99) ... pin suave donne non faccia Signore ‘... he sleeps more sweetly than a 

Nobleman would do’ 
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where che would now be required. These can be paralleled by French 
examples without the subjunctive like: 

(100) ... plus frois nefu.‘... colder than it was’. 

Such examples may, however, possibly stem from poetic licence, and are 
absent from the very earliest texts. A parallel may be drawn with the not 
infrequent medieval omission of the complementizer, when the combination 
of the negator and the subjunctive unambiguously signals subordination. 

It would appear that in the earlier texts in some languages use of the 
subjunctive and of negation in the complement clause were alternatives: 
possibly the former implied a more tentative judgement of comparison. In 
some languages, though, the negator has become almost a marker of clausal 
comparison. From early Old French negation in the complement clause 
was regular, except when there was already negation in the principal clause, 
and the subjunctive was infrequent. In Italian, however, use of the negative 
and subjunctive became complementary, though some variation persists: 

(101) Maria epiu intelligente che non sia Gianni ‘Mary is more intelligent than John 
might [not] be thought to be’. 

In modern standard French the use of the ‘expletive negative’ ne in com¬ 
plement clauses of comparatives is regarded as elegant but optional. In 
older French a full negative was frequent, and even in the seventeenth cen¬ 
tury we find examples like: 

(102) Vous avezplusfaim que vous ne pensez pas ‘You are hungrier than you [don’t] 
think’. 

Note that whereas today a pro-adjective is virtually mandatory in standard 
French; 

(103) Elle est plus intelligente que Jean ne Vest ‘She is more intelligent than John is 
[not so]’ 

this was not true in older French, where ellipsis, rather than pronominalization 
of the adjective was regular. 

In Spanish texts, too, before the seventeenth century, a negative could be 
used in a finite complement clause depending on a comparative, though 
this usage appears to have been less frequent than in French or Italian. A 
negation marker appears in the modem standard only when the link 
particle que would be followed immediately by complementizer que\ 
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(104) Mas vale que vengas que no que quedes ‘It is better that you come than [not] that 
you stay’. 

In less prestigious speech no is also frequently used in phrasal complements: 

(105) Elperro es mas feroz que (no) el gato ‘The dog is fiercer than [not] the cat’. 

Catalan, on the other hand, optionally uses no (pas) after a que introduc¬ 
ing a finite verb: 

(106) Dona mes que (no) promel ‘He gives more than he [doesn’t] promise(s)’. 

It appears that the negative is not normally used after a de introducer. In 
Rumanian there are, however, examples of polarity items being used in the 
older language: 

(107) ma di^teptu di nis / modem mai de^tept deal dmsul ‘more intelligent than no- 
one/anyone’. 

In modern Italian there has also been analogical spread of the negator from 
cite to di quanto constructions (see below). 

In modern French the link particle que has spread to most comparative 
constructions at the expense of de, which lost ground in the early modern 
period, and is now confined to very few types of comparison. In modern 
Spanish and Catalan, too, the que particle is used with a nominal comple¬ 
ment, where modern standard Italian prefers di (though che is usual in 
Northern varieties, and a sequence of both forms is also attested): 

(108) plus haul que moi\ mas alto que yo; mes alt que jo / Ital. piu alto di me, Sard 
(Nuoro )prur artu 'e mimmi ‘taller than me/V. 

Portuguese in the same context may use que but prefers a combination do 
que, with de followed by a pronoun and relativizing complementizer: 

(109) mais alto do que eu. 

The same sort of sequence introduces a finite clause preferentially in 
modern Spanish and Portuguese, and more optionally in Catalan: 

(110) mas alto de lo que parece\ mais alto do que parece', mes alto del que (Ique no) 
sembla ‘taller than he seems’. 

In Italian (though apparently not in Sard) similar sequences of di + pro¬ 
noun / quantifier + relativizer are preferred in modern colloquial speech to 
introduce a finite complement: 
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(111) piu alto di quanto (non) creda / di quel che pensavo ‘taller than he believes / than 
I thought’. 

The Italian di quanto sequence is etymologically cognate with the standard 
Rumanian link particle decit: 

(112) mat inalt decit mine ‘taller than me’. 

In phrasal, though not clausal complements, on the other hand, some vari¬ 
eties of colloquial Rumanian prefer the complementizer-like introducer ca. 

Thus what originally looked like a contrast between a phrasal and a 
clausal complement of the comparative, the former introduced by de and 
the latter by a QU-complementizer, has been obscured in the modern lan¬ 
guages, which have generalized along different lines. The QU- option has 
been most widely generalized in French, and the de option most in Italian. 
In the Iberian languages the use of the nominative pronoun, as in (108) 
and (109) above, hints at reduced sentential status for the target of the 
comparison, as distinct from the phrasal status in French, Italian, Sard 
and Rumanian, where a postprepositional pronoun form is used. Yet in 
modern Spanish a clausal comparative complement is not usually intro¬ 
duced by the simple complementizer que, but rather by de + pronoun + rel- 
ativizer que. 

The differences in usage between the languages seem to have developed 
only during the modern period: the evidence of medieval texts suggests 
retention of the Latin difference between phrasal and clausal comple¬ 
ments. Later, however, there has been confusion between the two, resulting 
in a degree of redundancy, and differential development, with French, 
Spanish and Rumanian reducing the redundancy most radically. 

(d) Pronouns / determiner forms 

One simple example of exaptation in Romance is the specialization of 
pronominal forms that were originally phonologically determined into differ¬ 
ent morphological categories. We have already seen (in 3.3) that the deter¬ 
miner systems of Romance were to a large extent developed out of the Latin 
pronominal system. It is surprising that so much ‘bricolage’ (or ‘do-it-your- 
self’) was applied to the Latin distal demonstrative pronoun ille set of 
forms, while other sets were abandoned. In one and the same language - like 
French - ille provides the stuff from which new categorial sets were tinkered. 
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Subject and object pronouns, the definite article and the demonstrative sys¬ 
tem, not to mention the word for ‘yes’, all have drawn on the one source. 

The formal differences that are today associated with distinct functions 
can often be traced to phonological contextual variants: thus clitic person 
3 subject and free-standing feminine pronoun elle and feminine clitic 
object and definite article la originate respectively as clause-initial, with a 
vowel onset, and proclitic, with initial vowel elision, variants of the same 
Latin form illa. In Italian the masculine singular definite article il, with 
vowel onset, still has a variant lo, identical with the object pronoun, used 
before s'-clusters that require the support of a vowel to aid syllabification 
(see 7.2 for prosthetic vowels). 

In Spanish the equivalent article form el, identical (except for a graphi¬ 
cal accent) with the subject pronoun, is differentiated functionally from lo, 
the ‘neuter’ article, which in its turn is formally identical with the object 
pronoun. Although it can be claimed that the modern forms perpetuate a 
Latin case distinction, between ille and illum/illud, there is no evidence 
for direct line of descent, rather than the effects of phonological accident. 
In French and Italian person 3 subject pronouns il and egli remain distinct 
from the article forms and cannot be explained in terms of Latin case. The 
final [i] which subsists in Italian and must have had metaphonic effect in 
French can be accounted for only by postulating analogical influence from 
the interrogative/relative Qui. 

The familiar story applies here. Whereas in early texts, phonologically 
contextual forms of ille were not clearly functionally differentiated, by 
the early modern period they came to be assigned different roles and sight 
was lost of their phonological motivation. However, what would have 
become meaningless ‘junk’ was recycled in rather different ways by the 
diverse languages, in the formation of new systems unknown to Latin. 


(e) Past participle and adjective 

Another example of exaptation is the way analogically regularized past 
participles have differentiated from inherited irregular forms semantically 
and categorially. This is most evident in Spanish, which has (like 
Portuguese) standardized its regular past participles into two conjugation 
types -ado and -ido. The earlier -Otu ending, which persists in other lan¬ 
guages, has usually been replaced in the modern language by -Ixu (to such 
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an extent that it has been suggested that the u form was never truly 
Castilian, but an Occitanism, or a Leonese or Aragonese form): 

(113) Span, tenido, Ptg. tido / Cat. tingut, Ital. tenuto, French tenu. Rum. tinut (but 
Sard tentu). 

Analogically reformed participles, used for instance in the compound past 
or the passive, exist alongside older ‘strong’ forms with only adjectival 
function: 

(114) Span, suspense ‘hanging’ / suspendido ‘hung’, confuse ‘confused’ / confundido 
‘confounded’,y««to//untado ‘joined’. 

In Portuguese, on the other hand, frequently both ‘strong’ and ‘weak’ 
forms survive as stylistic variants, though the former is less often used with 
verbal function: 

(115) divisoldividido', rololrompido; surpresolsurprendido\ junto!juntado. 

Past participial doublets are not unknown in other languages, usually with 
some stylistic or regional variation: 

(116) Ital. \istolveduto; persoiperduto. 

7.6 LEXICAL AND SEMANTIC DIFFERENTIATION 

In chapter 2 we suggested that shared lexicon was what identified the 
Romance languages. This may be true at the level of basic lexicon; yet 
there is conscious perception of more subtle lexical differences within 
Romance: there are few Italians or Spaniards who have not sniggered over 
burro (‘butter’ in Italian, but ‘donkey’ in Spanish). 

’"BOrriccu ‘mule’ (Classical bOrrIcu ‘pony’) rivals in the Iberian languages 
ASINU ‘donkey’. In Latin America it has acquired a host of different meanings, 
relating to plants, perhaps via the Portuguese use to mean ‘saw-bench’ (Span. 
caballete). Daco-Rumanian uses for ‘donkey’ the ‘substrate’ word mdgar, 
though Arumanian preserves asin. Butyru, of Greek origin, entered Italian as 
burro probably via borrowing from French (see also butirroso ‘buttery’). The 
Rumanian word for ‘butter’ unt (UNcru ‘ointment’) is paralleled in standard 
Italian (as elsewhere) in the meaning ‘grease’, and in Northern varieties for 
‘butter’. Whereas Old Catalan used bure for ‘butter’, the modem language has 
borrowed mantega from Spanish (see 6.3). 

Whereas phonological differentiation can be ignored or overcome, seman- 



320 


The differences 


tic or lexical differentiation is part of the conscious assertion of identity, of 
‘otherness’, of a linguistic community. An analogy with wine is one most 
Romance speakers would appreciate. Slight variations in weather, soil, 
maturation, balance of grape varieties, or wine-making techniques can 
effect quite dramatic changes to the taste (and aroma) of the wine: yet the 
connoisseur can still identify its provenance, its personality, and attempt to 
analyse the impact of the variables on the final product. Study of lexical 
variation in Romance has something of the same fascination, and necessi¬ 
tates the same sort of meticulous attention to detail and to multiple causa¬ 
tion. Lexicological and onomasiological studies are particularly popular 
among Romance scholars. ‘Butterfly collection’ it may be, from the point 
of view of the dedicated theorist, but the sheer wealth and wondrous vari¬ 
ety of data cannot fail to hold in thrall those whose interest lies in lan¬ 
guages as they are used in the real world. 

Here we can only glance briefly at one or two examples of how different 
Romance languages have differentiated lexically. In each case the differentia¬ 
tion need not always hamper communication, as frequently the usual word 
in one language may be tagged by another form that differs only slightly in 
meaning, or prestige. We have already seen how the Romance languages 
have borrowed from substratum and superstratum languages, from each 
other and from Latin. In this section we shall consider a few examples of 
how Romance languages have called in different lexical forms to express 
similar meanings, and how they have assigned different meanings to formal¬ 
ly similar lexical items. The illustrations are chosen by virtue of the complex¬ 
ity of their interaction and their emotive connotations. It will be noted that 
sometimes the etymology is uncertain and the semantic shifts obscure. 


(a) Verbs 

The Romance languages provide rich material for the study of lexical and 
semantic fields and interaction between different forms referring to activi¬ 
ties, events and sensations. The following list illustrates with the kind of 
examples frequently studied by Romanists. 

Verbs: some differences between Romance forms 

Killing: Latin interficere ‘to do away with’ did not survive into Romance, but 

occlDERE ‘to cut down’ was current in the medieval languages and survives in 
Italian, Sard and Occitan as the normal word. Necare is a general word for ‘to 
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kill’ in Rumanian dialects south of the Danube, but means ‘to drown’ in most 
Romance languages and ‘to infest with plague’ in some Italian dialects. French 
tuer deserves special mention. Until the seventeenth century, and still in 
regional varieties, it was used for putting out a fire. Its etymological origin 
(*TUTARE ‘to take care of’) suggests Mafia-like connotations. Some believe its 
modem meaning arose from damping down a fire (so as to stir it up later), oth¬ 
ers that it generalized from expressions like ‘to slake (and thence to get rid of) 
thirst’. The Iberian matar words may derive from mactare ‘to slay, smite’ or 
from the Persian word that gives us ‘(check) mate’, or perhaps *mattu 
‘insane’. Rum. a omort{l a ucide) is of Slavonic origin. 

Stopping, staying, closing, leaving: Latin (RE)MANERE,’to stay’ is known in most languages, 

but by the seventeenth century Old French remaindre was replaced by the early 
Latinism tester < restare ‘to stand behind, stand still, be left’ (also Ital. 
(ar)restare, French arreter ‘to stop’). A Latin derivative ’"quietare ‘to be quiet, 
to calm’ (cf Ital. quietare) gives Spanish quedar ‘to stay’ and French quitter ‘to 
leave, take away’. Parare ‘to get ready’ in Italian means ‘to shield, block, stop, 
defend’, in French ‘to ward off’ but in Spanish ‘to stop’. The Italian normal 
word for ‘to stop’ is fennare from the Latin word ‘to make firm’, which 
becomes the French word for ‘to close’ (replacing clore < claudere in the sev¬ 
enteenth century). In Spanish the Latinism firmar means ‘to sign’, a develop¬ 
ment that can be paralleled semantically perhaps by Vtg.fechar ‘to close’, 
which seems to be related to the Spanish word for ‘date’, from the closing for¬ 
mula of medieval documents < facta (‘things done’, cf modem Span, hecho 
‘done’). The normal Spanish word for ‘to close’ cerrar may be derived from 
sera ‘lock’ (in Italian and French its cognates mean ‘to squeeze in’, with the 
related noun used for ‘dyke’ or ‘greenhouse’ etc.). Claudere ‘to close’, known 
in nearly all the medieval languages, survives in Ital. chiudere. Rum. a inchide, 
but Catalan and Occitan prefer tancar, which may relate to a form *stanti- 
care, which in Occ., Cat., Span, estancar and French etancher primarily 
means ‘to stem the flow of water’ (cf French etang ‘pond’). 

Tricking and duping: DEcipere ‘to catch out, deceive’ survives into some medieval languages, 
taking on the meaning ‘to disappoint’ in modem French. The more usual 
words, however, seem to come from *ingannare, implying ‘mockery’, formed 
from GANNiRE ‘to yelp, snarl’: Ital. ingannare, R-R (Engadine) ingianare, Occ. 
inganar. Cat. enganyar. Span, enganar, Ptg. enganar. French (and hence 
Occitan, Catalan) uses, from the fourteenth century a similar metaphor ‘to 
make game of’ from ‘to play the trumpet’ (tromper). Rum. a tn§ela may be 
derived from sella ‘saddle’ (the same form / a in§aua, means ‘to saddle a 
horse’), using a different metaphor, though some seek its origin in Bulgarian. 

Wanting, seeking, asking for, finding: ’"Volere ‘to want’ has been lost, and amare ‘to love’ 
has declined in Spanish and Portuguese, replaced by querer < quaerere ‘to 
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seek, ask for’. Logudorian Sard uses kerrere in the same way, borrowed per¬ 
haps during the Spanish occupation of Sardinia. Quaerere supplies the usual 
Italian word for ‘to ask’ (chiedere apparently with dissimilation of intervocalic 
[r] to [d]), and the older French words for ‘to seek’ (querre, then querir). ‘Ask’ is 
rendered in Spanish, Portuguese and Logudorian Sard by percontare (pre- 
gunlar(e), perguntar), and ‘seek’ in Spanish and Portuguese by buscar, appar¬ 
ently derived from a Germanic form for ‘bush, wood’. The putative link with 
hunting is reflected also in the more widespread Romance word for ‘to seek’, or 
originally ‘to search’, which is derived from circare, used for ‘to encircle’ or 
‘to go round a place hunting for something’. In Italian it seems very early to 
have acquired the ‘seek, fetch’ connotations, but in French it did not definitive¬ 
ly replace querir until the early modem period. The modern French form 
chercher, with assimilation of the initial to the medial consonant, compared 
with the older form reflected by the English borrowing search, may owe some¬ 
thing to the influence of Italian. Rum. a cauta may also be derived from hunt¬ 
ing vocabulary (< captare?, which also supplies words for ‘to seek’ and ‘to find’ 
in varieties of north Italy). Other Romance words for ‘to find as a result of 
seeking’ may also linked to hunting metaphors: aflare ‘to sniff out’ seems to 
be at the root of Span, hollar, Portuguese achar and Rumanian a afla, whereas 
the rather mysterious French and Italian words trouver, trovare may originate 
as a fishing term, with turbare referring to stirring up the water (in Sard it 
means ‘to urge on, to drive, beasts of burden’). Others, however, seek the origin 
in poetic invention via tropare, formed on the word for trope. ‘To ask insis¬ 
tently, to beg, pray’, derives in most modem languages from precare, but 
ROGARE, which is known in most older languages, survives in Spanish and 
Portuguese (rogar), in Rumanian (a ruga) and in Rhaeto-Romance (Engadine 
ruvar). 

Working-. Operare has tended to specialize in the sense of artisanal achievement, where¬ 
as LABORARE ‘to toil. Suffer’ (except in Italian) has moved over exclusively to 
agricultural activity, especially ploughing. (The Argentinian Spanish expres¬ 
sion Que labura! ‘What a chore!’ is borrowed from Italian.) The most wide¬ 
spread verb for ‘to work’ is derived from an instmment of torture with three 
spikes, tripalium: French travailler. Span, trabajar, etc. (in English the ‘tra¬ 
vail’ is associated with moving around - ‘travel’). Rum. a lucra < lucrare ‘to 
gain, profit’ is associated with reward, though the frequent but more restricted 
Slavonic word a munci has ‘painful’ as well as political connotations. In the 
same spirit of seeing work as painful we should perhaps note the specialization 
of French chomer, derived from the Greek cauma ‘heat’, originally ‘to rest, to 
keep calm’, in the sense of ‘to be unemployed’ (cf the loanwords Rum. corner, 
^maj etc.). In Italian we have the transparent disoccupato, senza lavoro for 
‘unemployed’, and the Iberian languages use the participial form of parar ‘to 
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stop’. The idea of withholding labour as a weapon against employers is a mod¬ 
em one, and it is not surprising that words for ‘strike’ differ: in nineteenth- 
century French, greve, in this sense, derives from the gravelly site on the banks 
of the Seine where workless labourers gathered in hope of being hired (Rum. 
grevd is borrowed). Catalan vaga comes from vacare ‘to be empty’ (cf. Span., 
Ptg. vagar ‘to be idle’). Span, huelga, Vtg.folga, derived ultimately from FOLLE 
‘bellows, windbag’, suggest a similar semantic evolution; note that the 
Andalusian form juerga entered the standard in the nineteenth century in the 
sense of ‘binge, spree’. Ital. sciopero (< *exoperare ‘to stop work’) also has 
emotive overtones (cf. e.g. scioperone ‘shirker, fool’). 


(b) Nouns 

There is less interaction between nouns in Romance, especially when they 
refer to concrete things. Often one Latin word has survived in a number of 
languages, but sometimes there is a maverick form in others. Here are a 
few examples from common concrete nouns. 

Some everyday Romance nouns 

Cheese: CASEU gives most Romance words, but French frontage (borrowed as Ital.ybr- 

maggio. Cat. formatge) refers to the shape (forma) of the cheese. Rum. brinza 
is of uncertain origin. 

Dog: Cane gives the usual Romance words, but the isolated Spanish perro, and 

Catalan gos, which has now obsolete parallels in several other languages, may 
be onomatopoeic. 

House: From casa ‘hut’ nearly everywhere, but the French form chez has become syn¬ 

tactically restricted to locative prepositional status (‘at the house of’; similar 
forms are found in north Italy) and is replaced by maison < mansione ‘stay’, 
hence ‘place one stays’ (in other Romance varieties ‘cattle-shed’ etc.). Domus 
‘house, home’ survives in the same sense in Sard, but Italian duomo has special¬ 
ized as ‘cathedral’ (borrowed as French dome). 

Meat: carne survives everywhere, but modem French chair means ‘flesh’ (cf also 

faire bonne chere ‘to eat well’, which may involve cara ‘face’ or carne ‘meat’). 
From the early modern period, viande is used in French for edible meat, pene¬ 
trating also to other languages. It derives probably from vivenda ‘things neces¬ 
sary for life’, which is used for a variety of foodstuffs in other Romance vari¬ 
eties (cf Span, vianda, from the French, ‘food’ / vivienda ‘house, dwelling’; 
Ital.vivanda ‘dish of food’). 

Paper: The Egyptian plant name papyru supplies the most widespread Romance 

words (the Iberian forms are probably borrowed from French papier). Rum. 
papurd ‘bullrush’, Spavt. pabilo ‘candle-wick’ are nearer the original meaning. 
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Greek charta ‘paper’ is reflected in Ital. carta, but more widespread meanings 
are ‘charter, letter’ (French carte ‘card’ is a fourteenth-century Latinism). 
French lettre ‘letter, written communication’ derives from the plural LITTERA 
‘letters’, whereas the equivalent Rumanian word scrisodre is a derivative of 
SCRIBERE ‘to write’. Rum. hirtie ‘paper’ is borrowed from a Slavonic word, itself 
probably borrowed from Greek charta. 


On the whole, parts of the body retain in Romance their Latin names 
(see page 91). However, popular shifts of meaning are frequent, resulting 
in some notable variation between the languages. The following list gives 
some examples. 

Parts of the body 

Belly. Venter gives popular words everywhere, but pantice ‘paunch’ rivals it in most 

languages. Span., Ptg. barriga (from the fifteenth century) is from the word for 
‘barrel’ (seventeenth-century barrica is borrowed from Gascon). Rum. burtd 
(Ipintece) is of unknown origin. 

Face: ORE (also ‘mouth’) survives nowhere. Facie (‘form’) remains a primary word 

in Italian and Rumanian (also in older French). Vultu (‘appearance’) gives 
Ital. volto ‘countenance’ (also older forms like Old French vout). Greek cara is 
a widespread form (Span., Cat., Ptg. cara. Old French chiere, also in north 
Italy). Visu ‘appearance’ is also common: Old French vis was replaced by 
derivative visage, which in the modem language is rivalled by more popular fig¬ 
ure. Span., Ptg, rostro. Cat. rostre is from rostru ‘beak’ (words normally used 
for animals are often slangily used for the human face in other Romance lan¬ 
guages). Rum. chip is from Hungarian. 

Hand: Manus nearly everywhere, but branca ‘paw’ is also current in Rumanian, 

Sard and Rhaeto-Romance. 

Head: Caput and its derivatives remain common, but in French chef retains only fig¬ 

urative meanings. Testa ‘egg-shell’ became the usual word in French (bor¬ 
rowed as ‘head’ into numerous other varieties, and in others still meaning 
‘skull’, ‘nutshell’, etc.). 

Jaw: Maxilla gives Ital. mascella. Cat. maixella ‘jaw’. Rum. indsea ‘molar tooth’ 

and Span, mefilla ‘cheek’. Late twelfth-century French introduced the deriva¬ 
tive mdchoire from mdcher < masticare ‘to chew’, which ousted Old French 
maisselle. Ital. ganascia ‘jaw’ derives from the Greek. Ptg. queixo ‘jaw, chin’. 
Cat. queix ‘cheek’ ultimately derive from CAPsa ‘box’: Span, quijada is today 
used for an animal jaw, while the eighteenth-century Latinism mandibula refers 
to humans. Kum. falcd ‘jaw’ is said to be from Latin falcis ‘sickle’ - a meaning 
shift that is apparently found also in Albanian. 

Knee: Genuculu, diminutive of genu ‘knee’, is usual but medieval Span, hinojo 
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(which had become identical with the reflex of fenuculu ‘fennel’) was 
replaced by rodilla ‘knee cap’, a diminutive of rota ‘wheel’. 

Leg: Latin crure does not survive. Span, pierna. Cat., Ptg. perna continue Latin 

PERNA ‘ham’, whereas elsewhere Greek camba ‘joint, leg’ is preferred (Cat. 
cama, Occ., Sard camba, R-R (Engadine) chammd). Ital. gamba, French jambe. 
Rum. gamba (‘shin’), R-R (Surselvan) tgomha derive from a variant gamba, 
attested in fourth-century Latin texts. 

Shoulder: Humeru is widespread but French epaule derives from Greek spatula ‘flat 

sword’, which also gives Span, espalda ‘back’ and Ptg. espadua ‘shoulder-blade’. 

(c) Adjectives 

Common adjectives again usually are fairly uniform over the Romance 
area, but there are apparently random variations here as elsewhere, espe¬ 
cially where emotive factors intervene. 

Some adjectives 

Colours (two examples): Yellow: croceu ‘saffron’ survives only in Cat. groc and Sard grogu 
(cf. Old Ital. gruogo). Span, amarillo, Ptg. amarelo are derivatives of AMARU 
‘bitter’. Galbinu ‘greenish-yellow’ gives Rum. galben. Old French jalne (mod¬ 
em jaune), which was borrowed into other languages, notably Ital. giallo. 

Red: derivatives of ruber give Sard ruyu, French rouge. Cat. roig (cf Span. 
rubio ‘blond’), RUFU gives Cat. ruf and russu/russeu Ital. rosso. Cat. ros, 
French roux I Rum. roju, Span, rojo, Ptg. roxo. Cat. vermeil etc. are from ver- 
MICULU ‘grub’ (which provides vermilion). 

Dirty: From spurcu we have Rum. spurcaf, Ital. sporco seems to be crossed with 

PORCU ‘pig’, which also gives Span, puerco, Ptg. porco. The other common 
word for ‘dirty’. Span, sucio, Ptg. sujo. Cat. sutze, is from ’"suciDU ‘Juicy’ (cf 
also Ital. sucido, a Latinism, ‘grease’ / sudicio ‘dirty’/ sozzo, from Occitan, 
‘filthy’ / Old spurcido crossed with sporco). French sale is of Germanic origin; 
older words were put < putidu ‘stinking’ (cf modem putain ‘whore’), and ord 
< HORRIDU (cf Ital. dial. (Tuscany) ordo ‘dirty’, modern French ordure 
‘garbage’). The usual Catalan word for ‘dirty’ is brut (cf Ital. brutto ‘ugly’). 
Rum. murdar is of Slavonic origin. 

Mad: INSANU, DEMENTE do not survive as popular words. Folle ‘bellows’ gives 

words in Italian, French, Occitan and Catalan, though Cat. boig < balbiu 
‘stammering’ is also current. For Ital. matto, cf Span, matar ‘to kill’, and pazzo 
probably derives from patiens, which has in Latin the meaning ‘stubborn’. 

Sard mdkku seems to come from maccu, a name for a clown, and Rum. nebun 
is derived from bonu ‘good’. Other words are of more uncertain origin: Span. 
loco, Ptg. louco, for instance, have variously been derived from uluccu ‘owl’. 
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or the personal name Glaucu. 

Thin: Macru is the usual word but more evaluative ‘slim’ in some languages derives 

from a word connected with minutu; French, Occ. mince. Cat. minso. Catalan 
and Occitan also use prim < primu. In Rhaeto-Romance subtilu (Engadine 
stigl, Surselvan satet) is the preferred word. Ital. snello, also ‘nimble’, is from 
the Germanic word for ‘quick’ (cf Old French isnel), while more elevated esile 
< EXILE is probably a Latinism. 

Ugly. Foedu survives in Sard feu, Span.^eo, Vig.feio. French laid. Cat. lleig are 

Germanic borrowings. Cat. Hard (like French lourd ‘heavy’) is from luridu 
‘pale yellow, ghastly’. Modern French vilain was in the Middle Ages ‘a peasant, 
a villein’ from villanu ‘inhabitant of a villa’ (cf also English villain). Ital. brut- 
to is from brutu ‘heavy, dull’, and Rum. urit is a derivative of horrere ‘to 
bristle, look frightful’. 

Weak'. Debile survives in learned and some dialect words. Flebile ‘lamentable, tear¬ 

ful’ is more general, with dissimilation of the first l in some languages: French 
faible. Span. CaS. feble (hux. Ital.fievole, R-R (Engadine)_^a/ve/); and sometimes 
apparently crossed with fragile or gracile. However, fluxu ‘lax’ gives us 
more popular Span.^q/o. Cat. fiuix. Flaccu ‘flabby’ gives Ital.^acco and Old 
French flac (from the seventeenth-century,^axqMe), as well as the Latinisms 
S^an.flaco, Vtg.fraco. Rum. slab is of Slavonic, and §ubred of unknown, origin. 

7.7 CONSERVATISM IN ROMANCE 

The conservative tendencies of one or other of the Romance languages 
have often been mentioned in the course of our exposition. Some would 
go further and talk about ‘conservative languages’, though what constitutes 
conservatism is not always specified. Sard is sometimes cited as ultracon¬ 
servative, largely on the basis of a few features like the non-palatalization 
of velar consonants by following front vowels, the retention of final Latin 
-s, the failure to merge Latin short I and 0 with long fi and o, and the 
derivation of the definite article from ipse rather than ille. Yet Sard has 
undergone radical phonological transformations since the Latin era, 
including the opening and even loss of consonants in intervocalic sentence 
context and the change of lateral geminates [11] to retroflex plosive [c],]. 
Those who maintain that the central Sardinian varieties are archaic in 
character argue that the island was early isolated from innovations in the 
later Latin period. But there is no clear evidence that the centre of the 
island was thoroughly Romanized in classical times. The putative conserv¬ 
ative character may therefore be a reflection of a later introduction of an 
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artificial learned version of Latin rather than a natural evolution of an 
inherited spoken Latin. 

Portuguese is another so-called conservative language, allegedly owing 
to its geographical distance from the innovatory centres. It resisted innova¬ 
tive tendencies like diphthongization of tonic vowels, and the replacement 
of the simple past by a compound form, and preserved, however marginal¬ 
ly, verb forms, like the Latin pluperfect and future perfect indicative. Yet at 
the phonological level Portuguese is far removed from the Romance 
stereotype, with vowel nasalization, reduction of unstressed vowels and 
loss of intervocalic L and N. An interesting example of the ambiguous 
position of Portuguese on the ‘conservation’ scale is its unique use of old 
Christian terms for days of the week, where other languages perpetuate 
pagan terms: e.g. for ‘Monday’, segunda feira < secunda feria ‘second 
festival (after Sunday “Lord’s day”)’ rather than dies lunae ‘moon’s day’. 

Rumanian cannot qualify as a conservative language either, even though 
it is argued that its early isolation from the Western Latin world left it out 
on a limb. Its nominal case system and its ‘neuter’ gender seem, however, 
to be innovations rather than relics of the Latin equivalents. Its vocabulary 
is the least Latin-based of all the languages, and phonological and syntac¬ 
tic innovations are numerous. 

Each Romance variety has apparently random conservative features. 
Even in French, probably in modern times the most innovative of all, the 
old language retained traces of the Latin case system. We may wish to des¬ 
ignate as ‘conservative’ those varieties that, although they have undergone 
phonological transformations, have not radically refashioned their mor¬ 
phological systems by analogical processes, and have kept intact, by 
default, relics inherited from Latin. This would be most true of isolated 
local varieties, in southern Italy or in the Alpine region, for instance. The 
term ‘conservative’, on the other hand, may equally be applied to those 
varieties that have looked to Latin as their model, or have clung to tradi¬ 
tions established in a prestigious past. This would embrace the standard 
languages, especially Italian. 

That the languages should change at different rates and in divergent 
directions seems inevitable. How far these differences can be related to 
social factors is something we shall consider in the final chapter. 
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8.1 SOCIAL ASPECTS OF ROMANCE 

In nearly all parts of the Romance world the most salient social feature is 
regional variation. The social perception of such variation differs in dif¬ 
ferent areas, but speakers will almost always be more conscious of the 
geographical than of the social significance of language differences. 
However, educational status will also figure large in their evaluation of 
variants: on the whole there is still respect for traditional educational 
qualifications and for the types of language use these encourage. 
Moreover, there is often conflict between regional language varieties, 
which are identified with loyalty to a local community. Fear of loss of 
common cultural values, symbolized by language, has often focused on 
the need to preserve dying varieties and to discourage language shift. 
Support has thus been afforded to the dialectological studies and to the 
moves towards standardization of local varieties discussed in chapter 5. 
Much of this chapter will therefore be concerned with the interaction 
between different Romance varieties, as well as with neighbouring non- 
Romance languages. 

8.2 SOCIOLINGUISTIC VARIATION 

The sort of questions that preoccupy English-speaking sociolinguists are 
comparatively rarely discussed within Romance. It is mainly in those 
regions where geographical variation is limited that factors such as socio¬ 
economic status, age and sex loom large in variation studies. On the other 
hand, register and discourse-type differences are frequently studied, par¬ 
ticularly with reference to lexical and stylistic variation. In general it is 
probably true that Romance sociolinguistic thinking is more firmly rooted 
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in the conflictual tradition of Marx than in the consensus view associated 
with American theorists like Parsons. 

American-style sociolinguistics (in the methodology associated with 
William Labov) flourishes most, within the Romance sphere, in Canada, 
where the use of extensive spoken language corpora is exploited, using the 
statistical techniques developed especially by David Sankoff in Montreal. 
Here so-called ‘popular’ features of French, some of which are often com¬ 
paratively obsolescent in France, can be correlated clearly with socio¬ 
economic and educational status, and more ambiguously with age and sex. 
As a methodological tool for investigation the Canadian sociolinguists 
substituted for the usual sociological measures the concept of ‘linguistic 
market’, which matches language use with the social and professional 
requirement of speakers to employ more or less prestigious variants. 
Those employed in service industries and liberal professions, who require 
to exercise ‘communicational’ or ‘interpersonal’ skills, would be expected 
to make more use of standard linguistic forms than others, while the 
young (especially males), the very old and the ideologically disaffected 
would prefer nonstandard forms, particularly in informal interactions. 

A conclusion that may be tentatively drawn, from observation of non- 
prestigious variants in Canada, is that the dead hand of authoritarian 
purism has had less far-reaching effects there than in France. A possible 
explanation is that Canadian French usage to some degree has distanced 
itself from that of the Hexagon, even more than might be expected from 
the breaking of political links in the mid-eighteenth century. This certainly 
was so during the nineteenth century, especially in Republican periods and 
when France loosened its ties with the Church. Another explanation is 
that Canadian French had early reduced dialect diversity (by ‘colonial lev¬ 
elling’) and had established some sort of regional identity, which was but¬ 
tressed by a common resistance to the incursion of English. Although 
there are remnants of a localized ‘Acadian’ French on the east coast, sur¬ 
viving or restored after the deportation of the Francophone population by 
the British in 1755, there seems to be rather little in the way of geographi¬ 
cal variation in Canadian French, which in some respects appears archaic, 
even though there have been salient phonetic changes and extensive lexical 
borrowing from English. 

We should here note that sociolinguistic investigation of the Labovian type is 
exemplified also in other Romance-speaking American countries. In Argentina 
and Brazil, but especially among ‘Chicano’ populations in the United States, 
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there have been studies in this tradition. However, as in most parts of the 
Romance-speaking world, the question of interaction between different lan¬ 
guages receives more attention than that of social variation within one and the 
same language. 

In France, more recently than in Canada, there has been reduction of geo¬ 
graphical linguistic diversity. This was achieved very largely by deliberate 
institutional discouragement of dialect use, from the time of the 
Revolution, but especially since the establishment of universal education 
(see chapter 5). Another important factor may have been the sluggish 
demographic growth in modern France, and transfer of population from 
countryside to town, as well as the sizeable immigration into France from 
other countries. 

One unexpected effect of the imposition of the standard language on all 
Frenchmen has been the apparent development of a nationwide ‘popular’ 
variety that shadows the official language. ‘Popular French’ acquired 
covert prestige, especially in the first half of the twentieth century, as the 
idiom of the working class and of the young. The fact that some of the 
features of this ‘popular’ variety (like non-use of negative ne, of interroga¬ 
tive inversion, of the synthetic future or of prescribed auxiliary selection 
in the compound perfect) are also found in Canadian French suggests 
that they originate in older (seventeenth-century?) unprestigious usage, 
that remained underground, suppressed in the textual evidence by the 
weight of prescriptive authority. 

Already in the nineteenth century the ‘popular’ language, associated 
first with the lower social strata of Paris, was described with affection and 
amusement: it was usually linked to underworld argot that had been 
known from the fifteenth century. It began to penetrate into literary texts 
especially during the First World War, mainly in reported conversational 
exchanges between poilus (non-commissioned ranks). Between the wars it 
acquired political significance and was seen by some linguists as foreshad¬ 
owing changes in the French of the future, if oppressive authority could be 
prevented from blocking the creativity of the people. The ‘language of 
revolt’ was naturally vehemently condemned by traditionalists, who saw in 
it a threat to national culture, just as, later, traditionalists regarded the 
incursion of English loanwords. Since the Second World War, the ‘popu¬ 
lar’ idiom has penetrated to more and more discourse types, and today, in 
some form or other, it is part of the linguistic repertoire of nearly all 
French speakers. 
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Sociolinguistic investigation in France is mainly concerned with use and 
distribution of ‘popular’ variants: as elsewhere, it is found that they are 
more current in disadvantaged, economically and educationally, sections 
of society and that they are favoured most by young males and least by 
elderly middle-class females. As elsewhere, among young people prefer¬ 
ence for nonstandard usage is so great that they are often seen as having 
their own secret language: language distortions like verlan ‘back slang’ are 
particularly favoured, as are swiftly changing fashionable lexical items. 
However, the idea that there is a ‘popular language’ distinct from French is 
challenged by linguists, who point out that deviant usage occurs in only a 
smallish proportion of any stretch of discourse. 

The almost obsessive preoccupation of the French with their language 
has often been noted. This dates from the sixteenth century, when efforts 
for codification began, fortified in the seventeenth and eighteenth centuries 
with the insistence on the social and cultural importance of language 
standardization, and at the Revolution, which stressed the necessity for 
unifying the national language. The education system has always placed 
emphasis on the value of clear, elegant, concise and fluent expression, and 
on grammatical correctness and lexical precision. Newspapers continue to 
feature regular columns on language use and grammatical subtleties, and 
high-profile television programmes cover language matters alongside other 
cultural topics. The flip-side of this interest in language is the phenomenon 
of insecurity that besets those who feel disadvantaged linguistically, and so 
are reluctant to display their incompetence in public. Studies suggest that 
linguistic insecurity may be more prevalent in northern industrial areas, 
where local dialects are virtually dead, than in the south, where speakers 
may retain more sense of their regional linguistic identity, even though 
their Occitan dialects may be moribund. 

8.3 LANGUAGE CONFLICT 

The idea of language conflict was first fully developed within the Romance 
sphere, by Catalan activists protesting at the subjugation of their language 
to Castilian. This was a political issue especially during the Franco era, 
when language nationalism was identified with separatism and seen as a 
threat to the unity of the state, so that political moves were made to sup¬ 
press the regional languages. Mussolini adopted a similar policy, but car¬ 
ried it out less thoroughly. 
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The belief, however, that language is a badge of cultural and ethnic 
identity dates further back and is associated with revolutionary ideas and 
with Romanticism. It was during the nineteenth century that minority lan¬ 
guages within Romance sought to assert their worth and seek cultural, if 
not political, autonomy. The spread of standard languages, identified as 
the idiom of the state, began earlier, as we saw in chapter 5. From the six¬ 
teenth century already there were those, especially among the merchant 
classes, who perceived advantages in educating their children in a vernacu¬ 
lar language that had wider currency than the local dialect. The decline of 
Latin as the language of transnational communication encouraged the 
adoption of codified and unified standards within each Romance¬ 
speaking country. The growth of literacy was bound to level out linguistic 
variation, at least in the written language. 

This movement, however, had comparatively little effect on rural 
populations before the mid-nineteenth century, even in France, where 
ideologically the link between language and nationhood was most 
stressed. Educational surveys towards the end of the century suggested 
that a large proportion of French citizens had the perception that they were 
not speakers of French, but were conscious of having as their mother- 
tongue ‘another language’ - whether a Romance, Celtic or Germanic 
variety. One of the prime aims of universal education was to suppress 
these other languages, reducing them to subservient status and eventually 
eliminating them, so as to strengthen a feeling of unity, with participation 
in and loyalty to the French nation-state. Where the speakers were not so 
conscious of speaking ‘another language’ - that is, when they were users of 
Northern French nonstandard varieties - they were subject to similar pres¬ 
sures to ‘purify’ their own usage, by scorning local speech habits. 

On the whole, the drive towards unification of French succeeded; 
though, as we have seen, it may have bred a ‘popular’ sub-culture, and a 
degree of linguistic insecurity among those who learned to distrust their 
own language competence. Although not completely eliminated, regional 
varieties have declined, leaving often only traces that are socially stigma¬ 
tized. In the northern Picard old industrial areas, for instance, chtimi fea¬ 
tures, relics of older dialects, persist mainly in the speech of very old 
lower-class speakers (imitated sometimes especially by young dissident 
males), and in a few localities, usually in close contact with the Flemish 
that has lost ground steadily since the sixteenth century. In Belgium, 
dialect usage survived rather longer, especially in the eastern Walloon area 
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wedged between German- and Flemish-speaking regions, but looks almost 
certain to die out in the near future. Similarly, FrancoprovenQal varieties 
have virtually perished in eastern France and in Switzerland, barely surviv¬ 
ing in Alpine regions among the older generation. 

Catalan, in Roussillon, and the Germanic dialect of Alsace persist 
better, probably because of the support they receive from neighbouring 
varieties across state frontiers (in Spain and Germany or Switzerland), but 
in the towns especially there is a growing proportion of monolingual 
French speakers, and middle-class speakers often discourage use of the 
language by their children, on the grounds that it may hamper their social 
and economic prospects. 

In these latter regions, however, as in Brittany, Corsica and the Occitan 
areas, there is support for conservation of their own language. In Alsace 
and Roussillon, official encouragement usually takes the form of provid¬ 
ing teaching in related standard languages - German and Catalan - rather 
than in developing local varieties, just as in Corsica some advocate the use 
of standard Italian, and in Galicia of standard Portuguese. For Breton 
and Occitan, on the other hand, where there is no obvious extraterritorial 
model to follow, standard languages have been elaborated, that purported¬ 
ly serve as an umbrella for local varieties (see chapter 5). How far the 
decline of these languages can be halted, however, is open to doubt. 

It is the fear of decline and death of local languages, usually via shift to 
a dominant state language, that prompts language activism, sometimes 
even with resort to violence (here the most obvious example today is of 
Basque militants). The phenomenon is not, of course, confined to the 
Romance world, and often the motivation is not primarily linguistic, 
except in so far as the language is perceived as the emblem of ethnicity and 
subjugation of the language is a sign of withdrawal of political and eco¬ 
nomic power from the community that uses it. 

Language death is often an act of suicide rather than murder, as speak¬ 
ers (usually through their children) seek to desert a language that seems to 
offer no prospects of advancement. Usually in these circumstances bilin¬ 
gualism is perceived as a hindrance to prestigious use of the dominant lan¬ 
guage and there is resentment of imposed asymmetrical bilingualism, 
when the mother-tongue speakers of the dominant language are under no 
obligation to master the minority language. 

One feature of Romance language conflict which is rather unusual is 
that in some cases the languages in competition are so similar to each 
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other that speakers may not be fully aware in every case which language 
they are using. Here, as elsewhere, the social perception of language versus 
dialect status is all-important. I shall examine some questions relating to 
Romance-Romance bilingualism in the next section. 

8.4 ROMANCE-ROMANCE BILINGUALISM AND 
DIGLOSSIA 

The German term Binnendiglossie has been coined to describe use of two 
closely related languages in different social situations. The fifth volume of 
Posner & Green eds. (1980-93) fully covers this question for all Romance¬ 
speaking areas, so here I shall make only some general remarks about the 
contrast between France, Italy and Spain. In chapter 5 we saw that in 
France non-standard Romance varieties have largely given way to the 
national language: only in the Occitan region has this been a contentious 
issue. Although southerners often express general support for the revival 
of their language, it is noteworthy that champions are frequently not 
habitual users of Occitan varieties, while mother-tongue speakers, mainly 
in rural areas, tend to regard them as unprestigious and suitable for only 
intimate discourse. All speakers are bilingual and usage is diglossic in the 
classic senses of this term. Interpenetration of French and Occitan, pro¬ 
ducing a shifting ‘francitan’ interlect, is often seen as more a mark of 
southern identity than the codified standard Occitan. Frenchification of 
Occitan has been particularly evident from the sixteenth century, and 
Occitan (especially Gascon) use of French has long provoked amusement, 
tinged with condescending affection, on the part of northerners. 

In Italy most of the population operates with at least two varieties, 
which are broadly labelled dialect and standard. At present about 65 per 
cent of the population claim to have Italian as their first language: this 
proportion has increased massively since the Second World War. Most 
Italians have little perception of regional varieties as lacking prestige. It is 
said that in the interwar years esteem for relatively affluent Italo- 
Americans may have helped to bolster speakers’ favourable assessment of 
dialect: most immigrants to the United States were monolingual dialect 
speakers who had experienced no social pressure to shift to standard 
Italian. 

The linguistic situation in Italy is, however, much more complex than 
often realized, not least because of internal migration in modern times. 
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coupled with conspicuous territorial diversity. A commonly held simplifi¬ 
cation maintains that in the Industrial Triangle of the North (Turin- 
Genoa-Milan), koineization is general, whereas further south a mosaic of 
dialects persists. Certainly in Veneto, Lombardy and Piedmont, there have 
for centuries been written koineized ‘dialects’ (at earlier periods, alongside 
French), and these continue to be used (though more vigorously in the east 
than the west). On the other hand, in northerly Emilia Romagna each 
locality has its own idiom which is difficult to classify in terms of ‘dialect’, 
while Campania in the south has some degree of dialect standardization. 
In the industrial north, speakers often operate with a variety of ‘dialect’ 
forms that are more or less localized or koineized, as well as with a ‘stan¬ 
dard’ that is usually heavily penetrated with regional features. Whereas 
middle-class speakers may claim lack of proficiency in the more dialectal 
forms, this does not seem to be true of the majority of speakers, who are 
often proud of their own regional varieties (while mocking, for instance, 
Roman, Sicilian or Tuscan usage). 

Traditionally in Italy local nobility, as well as commoners, were said to 
use dialect, whereas ‘rootless’ Italian was favoured by the merchant class¬ 
es, and in the Papal States. Note too that until recently university courses, 
in Padua for instance, could be delivered in dialect, and that today dialect 
features, like Florentine gorgia toscana, or Roman consonantal lenition, 
have acquired ‘street credibility’ in many quarters. 

Dilalia has been suggested as a term to replace diglossia with reference 
to the Italian linguistic scene, which is characterized by overlapping of 
High and Low varieties, both being used in ordinary conversation, and by 
social stratification within the Low variety itself. The unifying thread of 
complex Italian usage seems to be the relationship of local varieties to the 
standard, with which they have interacted for centuries. The relationship 
has rarely been conflictual, though in the early modern period purism 
reigned with regard to the written code. Today most Italians, leaving aside 
the residue of Tuscan purists, are fairly relaxed about variation in the lan¬ 
guage, and it is usually claimed that there is no nationwide spoken stan¬ 
dard. 

One should mention, however, that propaganda of the northern Leagues 
proclaims the superiority of Northern culture and languages and thus 
antagonizes those who fear intolerance and extremism. It is an undoubted 
fact that economic prosperity differs widely between the Italian regions. 
Immigrants from the south, and from Sardinia, who seek work in the 
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industrial north are clearly disadvantaged linguistically as well as socially, 
in having to adjust to a new complex dialectal situation. It is to be noted 
too that Swiss Italian speakers, while retaining affection for their parish- 
pump Tessin dialects, tend to distance themselves from nearby regional 
koines, and espouse more purist standard usage for serious discourse. 

The position of Sardinia is more controversial, as some varieties there 
are distinct enough from the run of Italian dialects to merit classification 
as a different language. Here we should mention that some theorists 
believe that Sard falls together with Sicilian dialects, with which it shares 
some linguistic features as well as social problems, and that there is some 
resentment about the protection given to Sard as a ‘minority language’, 
while ‘Italian dialects’ are left to fend for themselves. 

Sardinia has had a particularly chequered history and has always suf¬ 
fered from grave disadvantages, including disproportionate depopulation 
and emigration. At present, as an autonomous region, the island is 
attempting to counter the effects of dialectal diversity and linguistic self- 
deprecation, but there seems little prospect that linguistic conflict, and the 
related questions of communal identity, can easily be resolved. 

Other separate Romance linguistic groups within Italy that should be 
mentioned are the Rhaeto-Romance speaking communities of the 
Dolomites and Friuli. The former area became (in the main) an 
autonomous region (Alto Adige / Sud Tirol), following a period of terror¬ 
ist activity by local German-speaking activists. The Ladin speakers there 
have always identified socially more closely with their German-speaking 
neighbours than with Italy, to which the region was joined after the dis¬ 
memberment of the Austro-Hungarian Empire. Although there are few 
moves towards unification of the disparate Rhaeto-Romance varieties of 
this area, they seem at present to be under no threat of extinction. The 
region now prospers, as a tourist playground, and linguistic conflict 
appears to be slight, although most Ladin speakers must be bi- or even 
trilingual (with German as well as Italian). 

In Friuli the linguistic situation is less happy. The autonomy of the 
region and of the language is not recognized by the Italian state, even 
though Friulian has had a literary form for several centuries. Most 
Friulian speakers are bilingual or even trilingual (using Venetan as well as 
Italian), and although there is considerable language loyalty, there are 
indications that more privileged classes discourage their children from use 
of the language, as lacking social prestige. 
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In Spain, the question of linguistic conflict takes on a very different 
complexion. The comparatively early unification of the country, with 
reconquest from the Moors and resettlement of reconquered areas by 
newcomers, reduced linguistic diversity within the country. But more or 
less homogeneous dialect groupings that did persist in the north came into 
conflict with central government especially during the Franco era. In 1978, 
however, was instituted a new regional policy, allowing a degree of linguis¬ 
tic and political autonomy to the regions, though decreeing that Castilian 
should remain the national language. Catalan and, less positively, Galician 
profited from the new legislation to establish themselves as regional lan¬ 
guages, distinct from Castilian. In other regions the separate identity of 
the regional idioms is disputed and there is wrangling about their status. In 
Aragon, for instance, the incursion of Castilian into the local language 
dates far baek and some see the regional language as no more than a vari¬ 
ant of the national one. In Asturias, local dialects clearly differ from 
Castilian, but also among themselves, and are referred to by the deprecia- 
tive plural babies. Passions are aroused in rows about whether a standard¬ 
ized form should be elaborated as an umbrella for the local varieties. 
Further south, there is discord about whether, for instance, Valencian is 
merely a variant of Catalan or a language in its own right, or whether 
Murcian or Andalusian are anything other than regional Castilian. In all 
these quarrels, myths about separate ancestry play more part than linguistic 
classification. A discrete ‘language’, that can be promoted educationally 
and culturally, is an object of veneration, while ‘dialect’ implies subordina¬ 
tion and decline, so that language figures high on the agenda of political 
activity in Spain, much more than in Italy. 

8.5 ROMANCE INTERLECTS 

Whereas, on the whole, Italians accept the consequences of interaction of 
dialect and standard as inevitable, purist attitudes, seeking to dictate lan¬ 
guage prototypes clearly differentiated from neighbouring varieties, are 
more widespread among Romance speakers. As we have seen, language legit¬ 
imacy is often regarded as established by unilateral historical descent (though 
never from further back than Roman times). So more archaic features are 
seen as more authentic than modem ‘bastard’ usage. Similarly a narrowly 
locally limited dialect, especially in the mouths of those who have never left 
the village, may be more highly valued than a regional dialect koine. 
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As we have seen, though, linguists assume that, before standardization, 
there was catenate mutual intelligibility between Romance local varieties, 
which did not fit smoothly into discrete classificatory slots. Today a terri¬ 
torial linguistic continuum has been replaced largely by polarization of 
dialects towards regional centres. In the northern Iberian peninsula, how¬ 
ever (also, but less so, in south-west France), it is still hard to draw a line 
between different dialect types. This is particularly striking when the 
dialect continuum bestrides a state frontier, as between Spain and 
Portugal, or between French Roussillon and Spanish Catalonia. 

Here we should mention the well-attested existence of some more or less 
stable Romance interlects that have grown up in Latin America. One is the 
fronterizo of northern Uruguay, which lies somewhere between Spanish 
and Brazilian Portuguese, and the other is cocoliche of the River Plate 
region of Argentina, which originated with massive Italian immigration 
into this Spanish-speaking area. In both cases, the interpenetration of the 
two related languages is so intense that speakers cannot reliably distin¬ 
guish them. Both varieties are regarded as socially unprestigious and are 
condemned by educational authorities. Yet they persist and appear to have 
emblematic value for the communities that use them. Descriptions of these 
varieties and their fluctuations appear in volume v of Posner & Green eds. 
(1980-93). It is suspected that similar, but less well-known, interlects sur¬ 
vive in other Latin-American areas, where there has been close contact 
between different Romance varieties. 

8.6 CONTACT WITH OTHER LANGUAGES 

Within Europe, there seems today to be resistance to influence on 
Romance varieties from neighbouring non-Romance languages. All the 
languages, of course, make extensive borrowing of lexical items from 
English, especially in the ‘Coca-Cola culture’ and the ‘computer-speak’ 
sort of sphere. But in areas like Belgium and Switzerland, where Walloon 
or Francoproven^al and French are spoken hard up against Germanic lan¬ 
guages, and where one assumes contact changes were frequent in earlier 
times, the Romance speakers today make little adjustment to their neigh¬ 
bours and linguistic frontiers have remained stable and virtually imperme¬ 
able for at least a century. In Alsace, French has penetrated the Germanic 
dialect, rather than vice versa, so much so that the dialects spoken on each 
side of the Rhine have grown quite far apart. 
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In the Swiss Orisons, where the Rhaeto-Romance dialects are receding 
before Swiss German, the Germanic influence is much more conspicuous, 
affecting especially the more westerly and northerly varieties. Isolates like 
Istro-Rumanian in former Yugoslavia, which only marginally survive, are 
even more open to contact influence. Istro-Rumanian has incorporated 
into its grammar even totally alien Slavonic features like aspectual verb 
distinctions. I have suggested (2.7) that the ‘Romanceness’, or more proba¬ 
bly the very survival, of such varieties is under threat. 

Although Rumanian today does not appear very much subject to influ¬ 
ence from neighbouring languages, its vocabulary, as we have seen, retains 
many earlier Slavonic, Magyar and Greek borrowings. Some features that 
distinguish it from other Romance languages are seen as ‘Balkan’, possibly 
deriving from a common substratum, as they are shared to some extent by 
other languages of the region, whether Slavonic, Greek or Albanian. 
Postposition of the definite article and reduction of use of the infinitive 
are among the features that have been explained in these terms. 

Perhaps we should mention here too the contact of Southern Romance 
with Romany. This is greatest in Rumanian, where the Gipsy population is 
numerous (2-3 million). The Ceaugescu regime forbade nomadism, so 
Gipsies were settled, at least temporarily, within more stable communities, 
though many of them have since migrated further west. How far they 
retained their own language is not properly recorded, but it seems that 
many Gipsy words have penetrated into informal, slangy Rumanian. 
Resentment of the Gipsies seems acute in present-day Rumania: just 
recently there have been attempts to stop them being called Romali rather 
than the official name J/gani, as it is feared that westerners equate Gipsy 
nomads with ‘more respectable’ Rumanians. Gipsy loanwords seem to 
have penetrated also into Spanish and Portuguese slang usage. Little is 
known, however, about what precisely the linguistic status of the Gipsy 
population is at the present time. 

8.7 ROMANCE OVERSEAS 

Romance languages are spoken all over the world, and it is not surprising 
that there are differences between them in far-flung regions. We have 
already discussed (5.7) the question of whether a distinction can be made 
between creole and overseas dialect. On the whole, where a language 
spread to an overseas territory as late as the nineteenth century, or when 
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there remained contact and exchange of population with the metropolis, 
the overseas variety has not deviated greatly from the home standard. We 
can contrast here, for instance, Portuguese creoles on the Guinea coast 
and in India with the more standard forms of Mozambique or Angola. In 
creole-speaking areas where close contact with the metropolis remains, as 
in French DOMs, the sociolinguistic relationship between creole and stan¬ 
dard has more the character of diglossia than of a creole continuum. In 
Brazil, on the other hand, more creolized forms are giving way to a stan¬ 
dard that is based on local usage, rather than that of the European stan¬ 
dard. 

Latin-American usage has never been totally divorced from that of the 
Iberian peninsula, in spite of the independence struggles of the nineteenth 
century. Immigration by Europeans, especially but not exclusively from 
Iberia, has continued to the present day. Diversity within the vast 
American territory remains immense but interintelligibility has not nor¬ 
mally been lost. It has often been suggested that conservative usage corre¬ 
lates closely with climate, assuming that European settlers were more at 
home in more temperate areas, but this does not adequately fit the facts. 
Rather, it would seem that coastal areas, with more contact with the out¬ 
side world and which were often permanently settled rather late, display 
more linguistic homogeneity than inland districts, where Spanish colonial 
administrative centres tended to be established from the mid-sixteenth 
century. 

Latin-American linguistic features have been mentioned throughout 
this book, so we shall not rehearse them again. On the whole, it can be 
claimed that many American features reflect sixteenth-century peninsular 
usage: for Spanish it is usually assumed that southern characteristics loom 
large and that the Canary Islands served as a funnel through which most 
of these reached America. More controversial is how far later divergences 
stem from the influence of indigenous languages. Clearly lexical borrow¬ 
ings, especially relating to locally produced items, were numerous; but pen¬ 
etration of Amerindian Spanish sociolects into the phonological and 
grammatical norm was more questionable, in the social climate that pre¬ 
vailed during most of the history of Latin America. In some parts, as in 
Paraguay, though, mixed marital unions were frequent and bilingual 
speech habits seem to have profoundly influenced the Spanish norm. In 
more modern times, popular political movements, as in Cuba, Bolivia or 
Nicaragua, have facilitated the social acceptance of lower-class speech pat- 
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terns. It is not easy to point precisely to ways in which Latin American 
usage has clearly been influenced by indigenous languages or by the input 
of African slaves, especially as nearly always a parallel can be drawn with 
some European variety, but John Lipski has sought to link, for instance, 
the frequency of clitic doubling in American Spanish with such influence. 

American French, on the other hand, shows little influence from 
Amerindian (for creoles see 5.7). The link with popular regional varieties 
of the seventeenth century is clear, though, as we have seen, ‘colonial level¬ 
ling’ tended to iron out dialectal differences early. What is in question, 
though, is how far later development was influenced by contact with 
English. Although lexical borrowing is very frequent, other influences are 
not as incontrovertible as sometimes assumed: for instance, lowering of 
vowels in blocked syllables seems to be carrying to its logical conclusion a 
process initiated in early modern metropolitan French. In some cases, 
however, it is hard to deny that English must have made a mark: for 
instance. Prince Edward Island French uses preposition-stranding in a way 
totally unknown to other French varieties. 

It is not possible here to survey all forms of ‘emigrant’ Romance, but 
Judeo-Spanish calls for particular comment. For most it is clearly basically 
a version of the language used in Spain in 1492 at the time of the expul¬ 
sion of the Sephardic Jews. Some maintain, however, that Judeo-Romance 
had retained distinctive characteristics from Roman times, when Jews 
formed an important element in the Imperial population and spread to 
different parts of the Empire. Spanish, French and Italian documents 
written in Hebrew script are said to show similar features, but so far they 
have been inadequately studied. 

The expelled Jews dispersed to the Maghreb, Italy and especially to the 
Ottoman Empire, where their services were much appreciated, at least 
until the seventeenth century. Undoubtedly Spain suffered a great loss 
from the emigration of skilled personnel: curiously Queen Isabella’s own 
medical adviser was expelled, and the carpet which she presented to him 
on his departure still adorns the tiny synagogue in Dubrovnik (Ragusa). 
Some Jews converted {marranos), but not infrequently took an early 
opportunity to flee elsewhere - for instance, to Holland or Brazil. Their lan¬ 
guage survives precariously today, mostly in Israel and various African 
and American locations, although many speakers perished in the 
Holocaust, and others have been assimilated linguistically. 

The Judeo-Spanish known as ladino was originally only a written Ian- 
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guage, used in Spain mainly for word-for-word translation of religious 
texts, for the benefit of those whose Hebrew was not up to scratch. It 
therefore has Hebrew syntactic features. The spoken language of the 
Sephardics was, however, known as Judezmo, which began as a number of 
disparate archaic Spanish varieties (with, for instance, retention of initial 
[f] - see 6.3), spoken in scattered ghettos, where a Spanish life-style was not 
maintained. These varieties changed little, though naturally they took in 
loanwords, and there was some simplification of phonological and mor¬ 
phological structures which can be paralleled from other Spanish varieties 
(for instance, the -se past subjunctive was lost - see 7.5). From the mid¬ 
nineteenth century, with the educational endeavours of the Alliance 
Israelite Universelle, however, judezmo underwent koineization and sub¬ 
stantial influence from French, and it came sometimes to be known as ladi- 
no. It was also adopted by more recent German Jewish immigrants to the 
Balkans, as a mark of Jewish identity. In western Europe it has disap¬ 
peared, though Sephardic communities maintain prestige, exerting influ¬ 
ence, for instance, on the Israeli pronunciation of Biblical Hebrew. 

8.8 LANGUAGE ATTITUDES: IN PLACE OF A 
CONCLUSION 

As I come to the end of this book, in which I am conscious of being able 
only to skim the surface of the rich linguistic material proffered by the 
Romance languages, I shall try to draw together some of the rather tangled 
threads of my earlier arguments. I have tried to show that in spite of all the 
diversity within Romance, there is nevertheless a degree of unity. That this 
depends to a large extent on common perceptions and attitudes I have 
tried to convey by my use of the catch-phrase ‘Romance club’. Myths 
about identity are not usually founded on hard fact, but on construction of 
a historical record. For the Romance languages, the history is there for all 
to see: no-one denies that they occupy today much of the geographical 
space covered by the Roman Empire. Puzzles remain: like why Britain or 
the Maghreb are not still members of the Romance club, whereas 
Rumania is. What I have tried to emphasize is the fellow-feeling, based on 
the idea of continuity of tradition, which certainly survived in the Middle 
Ages and fuelled common enterprises like the Crusades. 

What is more remarkable, perhaps, is that the unity of Romance was not 
shattered by the renewal, the ‘bom-again’ jerk that came with the 
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Renaissance and the Reformation, followed by the expansion of the 
Romance domain far beyond Europe. There was, as I have tried to show, 
differentiation between the languages, as dialects polarized to form 
statewide standards. But there was also promotion of language attitudes 
that created a different, more conscious sense of unity, with a turn towards 
the linguistic and stylistic ideals that had characterized Latinitas. 
Romanceness is more a matter of cultural awareness than of ethnic or reli¬ 
gious brotherhood. In the Middle Ages the languages drifted apart with¬ 
out any sense of direction, at the whim of random universal impulses. 
Speakers did not wholly realize what was happening, because Medieval 
Latin still provided the adhesive which held the edifice together. 

In the early modern period, however, heralding an era of comparative 
linguistic stability, the languages were differentiated, but on fairly pre¬ 
dictable lines. No unscalable barriers were imposed on easy acquisition of 
another Romance language by any Romance speaker. A recent prize con¬ 
test in Bucharest asked competitors to execute translations between vari¬ 
ous Romance languages, including some in which they had no previous 
competence. This proved a feasible task, and competitors readily coined 
forms that would be recognized, by users of the other language, as con¬ 
ceivably acceptable, though not current in general use. An invented 
Romance interlanguage is not beyond the bounds of possibility, and 
indeed Esperanto to some degree represents such an interlanguage, as did 
the earlier lingua franca. 

Until recently, most educated Romance speakers would, in any case, be 
familiar with Latin and would have acquired as their first foreign language 
another Romance tongue. Now, as English becomes the first foreign lan¬ 
guage for more and more countries, and as Latin descends further into its 
grave, some sense of the ‘sameness’ of the Romance languages may get 
lost. The same effect of ‘distanciation’ could be achieved by further 
‘democratization’ of linguistic discourse: informal colloquial forms in each 
language are usually further removed from the common linguistic pool 
than are more stilted standard forms. 

Perhaps then the Romance languages will move further apart, and the 
Romance club will put up its shutters. Only time will tell. 

FURTHER READING 


Posner & Green eds. (1980-93) vols. ll, ill and v give full cover of these ques¬ 
tions. On Romance sociolinguistics in general, Dittmar & Schlieben-Lange eds. 
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(1982) and relevant sections of the triennial proceedings of the Societe de 
Linguistique Romane. See also Green & Ayres- Bennett (1990). On contact 
studies, Thomason & Kaufman (1988), Fisiak ed. (1995). On European 
language questions, Parry, Davies & Temple eds. (1994). On language shift, 
Fishman (1991), Ureland ed. (1982). On ‘Balkan linguistics’, Solta (1981). On 
Latin American Spanish, Sala et al. (1982), Lipski (1994); on Canadian 
French, Sanders ed. (1993); on Judeo-Spanish, Sephiha (1986). 
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Arabic 183,233,240,250-2 
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Bastia see Corsican 
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Belgian see French; Walloon, Gaumais 

Bellunese see Italian 
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Bivian see Romansh 
Bolivian see Spanish 
Bonifacio see Corsican 
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Bozal see Creole 
Brava see Creole 
Brazilian see Portuguese 
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Bulgarian 17,91,321 

Cajun see French 
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Canadian French see French 
Cape Verde see Creole 
Caribbean see Creole 
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200-1,205,208,214-15,216,226,229, 
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275-9,290-3,297-300, 303,306, 
312-13,316,319,321,323-6,332, 334, 
338,339 

Alghero, Algherese 191,300 
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OW191,311,319 

Roussillon 191,205,334, 339 
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standardise, 260,268, 306,311 
Valencian 191,215,311, 338 
Western 43 

Caviteno see Creole: Philippines 
Celtic 3,4,29,89,100, 234-5,237-8,239, 
244, 333 
Gaulish 238 

Chabacano see Creole: Philippines 
Chicano, Mexican-American see Spanish 
Chtimi see French 
Cocoliche see Spanish, Italian 
Colombia(n) see Spanish 
Corsica(n): 122,195,216,290, 334 
Bastia 260 

Creole: 219,221,224-5,341 
African (Portuguese) 35,73,78,220,341 
American 35 

Angolar (Portuguese), see Gulf of 
Guinea 

Antilles (French), see also Guadeloupe, 
Grenada, Martinique, St Lucia 73,75: 
St Barth 73,221; St Thomas 73,222 
Asian 35,74,77,220 
Batticoloa see Sri Lanka Portuguese 
Brava (Portuguese) , see Cape Verde 
Cape Verde, crioulo (Portuguese) 73,74, 
75,77,78,93,220 

Caribbean (French) see also Antilles 219, 
220-1 

Colombia see Palenquero 
Congo see Palenquero 
Curafao see Papiamento 
DJoe-tongo 90,93,95,225 
Forro see Principe 

French 72-3,75, 78,80,220-2,303, 341 
Grenada (French) 221 
Guadeloupe (French) 68,77,221-2 
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(Portuguese) 73,74,84,220,341 
Gulf of Guinea (Portuguese) 74,221; 
Angolar 88,90,95,225; Annobon 225; 
Principe 68,80,220; Sao Tome 68,73, 
74,77,78, 80,88,93,220,225 
Guyana (French)80 

Haiti (French) 77,78,80,91-2,220,223, 
225 

India (Portuguese) 341 
Indian Ocean (French) 46,68,72-4 
Kristang (Portuguese) 73,122 
Louisiana (French, see also Cajun) 48, 
73-5,91,95,219,223,225; Avoyelles 
223; Lafayette 223; St Martin 75,223; 
Vacherie {St James) 223 
Malacca (Portuguese) 73 
Martinique (French) 221 


Mauritius (French) 72,77, 80,85,91,222 
Monco see Sao Tome 
New World 68,220-1 
Palenquero (Spanish) 75,219 
Papiamento, papiamentu 74, 75, 85,90,93, 
219 

Philippines, Chabacano (Spanish) 74 
Port au Prince see Haiti 
Portuguese (Afro-Lusitanian, Lusocreole, 
Indo-Portuguese) 73,74,75,95,219 
Reunion (French) 74,222,225 
Sao Tome (Portuguese), see Gulf of 
Guinea 

Samaraccan see Djoe-iongo 
Senegal (French) 84 
Seychelles (French) 72, 84 
Spanish (Afro-Hispanic) 75,219 
Sri Lanka 74 
St Lucia (French) 85, 221 
Surinam see Djoe-tongo 
Trinidad (French) 221 
Croatian see Serbo-Croat 
Cuba see Spanish 

Dacian 245 

Daco-Rumanian see Rumanian 
Dalmatian 16,95,104,113,195,197,229, 
292 

Istrian Peninsula 196 
Ragusa 195 

Veglia 12,14,16, 81,91,113-14,159,196, 
229 

Djoe-tongo see Creole 

Dolomites see Rhaeto-Romance; Ladin 

Dutch {see also Flemish) 14,90,93 

East Romance 107,197,252 
Egyptian 323 

Engadine/Engadinish/Engadinianice 

Rhaeto-Romance 

English 36,79,81,90-1,95,104,145,182, 
189,214,221,223,238-9,290,314, 329, 
330,331,342 
Etruscan 202,235,241-2 

Fassa see Rhaeto-Romance: Ladin 
Flemish 333,334 
Florentine see Italian: Florence 
Francoproven?al 18,25,46,116,123,142, 
147,176,194,198-9,200,204,238, 

240,246,262, 334,339 
Alpine 118, 127, 176, 334; Saint Martin de 
/a Porte (Maurienne) 118; Valais 128; 
Val d'Aosta {vaidotain) 194; Vaud291 
Guardia Piedmontese {Calabria) 176 
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Jura Vermes 22 
OW240 

Frankish 105, 138-9,233 
French (Oil) 1,4-6,12-13,19-21,23-9,35, 
37-8,43,45-7,51, 53-8,60,67-70,75, 
77, 80-7,89-95,99-101,104-6,108-17, 
119-20,122-7,131-2,135,137-41, 

143 7,149-50,158-65,167-9,171-2, 
175-84,189-94,197-205,210-12, 
215-16,218,220-6,229-30,233-5, 
237^0,243-50,255,257,261,264-9, 
273-8,285-6,288-94,297-312,314-19, 
321-27,330-1, 333-5,339,342 
Acadia, see Canada 
Alps 194 

American (see also Canada, Louisiana) 
48-9,222,342; Missouri (Old Mines), 48 
Argot 331 

Belgian see Walloon 
Cajun 48,223-4 

Canada 25,27,84,175,189,223,309,330, 
331, Acadian 25, 330; Prince Edward 
Island 342 

Central 209, 287-8, 306 
Champagne 209 
chtimi (Picard) 333 
Creole see Creole 
Demuin (Picard) 46 
Dominica see Creole 
East see Francoproven^al 
England (Anglo-Norman) 256 
Gaumais (Lorraine) 

Guadeloupe see Creole 
Haiti see Creole 
He de France 210 
Louisiana see cajun. Creole 
Martinique see Creole 
Mauritius see Creole 
Medieval see Old 
Middle 17,256,286 
Missouri (Old Mines) see American 
Modern 45, 53,134,173^, 240,248, 
259-60,274,276,279,286-7,289, 307, 
310 

Monaco see Occitan 
Nice see Occitan 

Norman 209,221: Old (see also England, 
Anglo-Norman) 13 

Northern 113,134,147,166,179,204,206, 
209,211-12,232-3,238-9,246,247,333 
North-East 287-9 

Old 12,14,21,29, 36,45,47,53, 56, 57, 

59,69,92,100,108-10,112,114,116, 
123-5,127-8,139-40,142,147,158, 
162-3,173,175,230,240,246-7,248-9, 


252,256,267,276,279,285,287-90, 
292,307-9,313, 315,321,324,326 
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Picardy 46,48,68,204,209,286,333 
Popular 331, 332 
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Saintonge 246 
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South 133^, 204,206,215,246,256, 

304 

South-East 190,194 
South- West 50,339 

Standard 17,45,84, 89,137,206,224,259, 
262,304-5 

Swiss (Suisse Romande) 89,189,303 
Trinidad and Tobago see Creole 
verlan 332 

Walloon 13,89,105,189,204,333, 339; 

East 238; Liege 205; Old 204 
Western 117, 123,204, 246 
Forro see Creole: Principe 
Friulian, Friulan, Frioulian see Rhaeto- 
Romance: Friuli 

Fronterizo/Fronteirigo see Portuguese 

Gadera see Rhaeto-Romance: Ladin 
Galego-Portuguese 191 
Galician, Gallego 28,79,95,122,125,142, 
165-6,170,175,191,194,207,215,231, 
252,256,265,334,338 
Gallo-Italian see Italian 
Gallo-Roman (see also Latin) 197 
Galluran see Sard 

Gardena see Rhaeto- Romance: Ladin 
Gascon see Occitan 
Gaumais see French 
Genoese see Italian: Genoa 
German 4,5,14,15,17,20,79,95,112,118, 
192,193,206,214,233,238,240, 
247-50,273, 304,334,337,343 
Swiss 193, 340 

Germanic 8,14,36,63,81,92,98,102, 
104-5,117,139,157-8,167,200,206, 
211,232,233,238,245-50,256,291, 
303, 322,325,326,333-4, 339,340 
Greek 3,13,14,84,99,108,126,131-3, 
139-40,144,147,150,165,171,175, 
228,233,237-8,239,319, 324,325, 

340 

Byzantine 65 
Gypsy, Romany 340 
Grenadian see Creole: Grenada 
Guadeloupe see French, Creole 
Guineense see Creole: Guinea-Bissau 
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Haitian see Creole: Haiti 
Hebrew 251, 342, 343 
Biblical 343 
Hungarian see Magyar 

Iberian 133,143-4,149,166,191,199,208, 
242-3,245,290,312-13,314,321,322, 
323,339, 341 
Northern 251 
Western 132, 136-7,209 
Proto-Ibero-Romance 197 
Indian Ocean see Creole 
Indo-European 36,75, 156, 179,185,200, 
234,299-300 
Istriot 196 

Istro-Rumanian see Rumanian 
Italian 3,6,12,14-15,19-21,23,38,40-5, 
47-50, 53,55-7,62-7,77-87,89-95, 
99-102,105,107,109-15,117,120-5, 
127-8,130-1,133,135,138-9,140-5, 
148-50,158-60,162-3,167-70,171, 
175-6,177,179-84,189-90,195,197-8, 
20(M, 212-13,216,229,234-7,240, 
245-6,250,256-7,259,261-2,264-72, 
275-9,286,290-4,297-301,303^, 
306-7, 309, 312,316-23,325-7,334-6 
Abruzzo 13-14,23-4,61,62,64, 77,79, 
101,119,160,198; Sulmona 62; Western 
61,64 
Apulia 198 
American 335 
Asolano 21 
Bari6\ 

Bologna see Romagna 
Bormio 24 
Calabria 18,83 
Campania 57, 336 
Carpignano 18 

Central 101,120,122,129,148,149,246, 
267,292 
Chieti 160 

Cocoliche (see also Spanish, Argentina) 
339 

Cori (see Latium) 

Corsica (see Corsican) 

Cremona 50, 257 

Emilia Romagna 13-14,17,81, 336 
Florence see Tuscany 
Gallo-Italian 217 
Gewou 18,48, 53,291,336 
Greek-speaking areas 233 
Imola 25 
Ischia 160 

Liguria\l-\%, SO, 190,213 
Latium (Lazio) 18; Cori 18 


La Spezia 198,229 
Livorno 241 

Lombardy 13-14,17, 52,192,291,336 
Lucania (Basilicata) 6, 61, 62, 83, 120, 
200, 297 
Lucca 241 

Marche (Marchigiano, Marsian) 
Servigliano, Ascoli Piceno 18, 62 
Medieval see Old 
Milan 17,24-5, 336 
Monaco (Monegasque) see Occitan 
Aapfe (Neapolitan) 18 
Nice (Niqois) see Occitan 
Northern 17,41,45-6,48, 51^, 77,79,88, 
104,120,122,132,137,149,178,192, 
201,206,213,217,230,232,236,238, 
241,244-5,247,256,257,267,277,291, 
299, 303-4,305,310, 336 
North-eastern 196,198 
Novara 18 

OW18,21,79,100,112,149,298,309, 
313-14, 325 
Padua 336 

Papal States 204,336 
Piedmont 17,24,297,336 
Pugliese see Apulia 
Rimini 198,229 
Romagna 17, 81 
Rome 213,236, 336 
Salento 64, 237 
San Marino 190 
San Tommaso 23 
Sardinia see Sard 

Sicily 18,24,64,91,122,133,183,237, 
256, 336, 337 
Siena 42 

South 18,41,48, 57,61,63-4,77,79, 88, 
101,105,112,120,122,131-2,134, 
136-7,149,164-5,197,217,229,233-4, 
236-7,243,246,286,292, 311 
South-Central (A, 120,165,236 
Standard!, 37,38,41,104,136,148, 

160,234,236,258-9,276,278,290, 

292, 303 

Swiss (Tessin, Ticino) 18,46,190,202, 337 
Trevi 120 
Tridentine 30 
Turin 303, 336 

Tuscany 17,24,41-2,49,64,66,77,124, 
195,202,212,216,234-5,241,256,261, 
268,272,290,325,336; North 235; Old 
50,311 
Umbria 120 

Veneto (Venetan) 18,21,41, 52,82,192, 
194,336, ^^l; Central 51 
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Venice (Venetian) 17,24, 52,53,88,195, 
196,198 

Italic 98,202,228 

Judeo-Romance 342 
Judeo-Spanish, Sefardi, Sephardic see 
Spanish 

Jugoslavia see Yugoslavia 

Ladino, Ladin see Spanish, Rhaeto- 
Romance, Dalmatian 
Languedoc see Occitan 
Latin 1-3,7,9,12,21,36-40,43,47,55-61, 
63,65,67,69-70,75,76,78,79,80, 
83-118,120-6,129-33,135,138^5, 
147-54,157-8,160-3,166-7,170,175, 
177-81,183-5,197-8,199,201-3,207, 
211,218,227-9,231-3,235,238^0, 
242-4,248,250-2,254-5,262,275-80, 
286-7,291-2,294-5,298-302,305-7, 
309-12,314,317-18,320-1,323-7,333 
Christian 126 
Church 104 
Ctoca/36,81,177 
Gallo-Roman 238-9 
Imperial 232 

Late99,102-3, 111, 113-14,129,135, 
144,154,158,177,179,229,231,235, 
314 

Medieval 104,153, 344 
Neo-Latin 104 
Popular 38,98 
Vulgar 4, 6, 98, 102-3, 197 
Western 327 

Latin American 41,83,136,169,319, 341-2 

Latium see Italian 

Leonese see Spanish 

Ligurian see Italian 

Limousin see Occitan 

lingua franca 181 

Livinallonga see Rhaeto-Romance: Ladin 
Logudorese, Logudorian see Sardinian 
Lombard see Italian: Lombardy 
Louisiana see Creole and French: Cajun 
Lucanian see Italian 

Macedo-Rumanian see Rumanian 
Madinka 84 

Magyar (Hungarian) 253,254,324,340 
Malaccan see Creole: Malacca 
Maltese 35 

Marches, Marsian see Italian: Marche 
Martinique see Creole 
Mauritian see Creole: Mauritius 
Megleno-Rumanian see Rumanian 


Mexican see Spanish 
Milanese see Italian: Milan 
Moldavian, Moldovan see Rumanian 
Monco see Creole: Sdo Tome 
Monegasque see Occitan: Monaco 
Mozarabic see Spanish 
Murcian see Spanish 

Neapolitan see Italian: Naples 
Ni?ois see Occitan: Nice 
Nidwaldisch see Rhaeto-Romance: 

Romansh 
Norse 239 

Nuorese-Bittese see Sard 

Oberhalbsteinisch see Rhaeto-Romance: 
Romansh 

Oberwaldisch see Rhaeto-Romance: 
Romansh 

Occitan, Oc 12-14,37,42-3,56-7,79,81-2, 
84,87,91,93,95,100,105-6,113-14, 
117,120,123,125,132,134,137-8,139, 
161,171,178,183,190,194,197-8,200, 
204,206,229-30,239^7,261,267,270, 
272,278-90,292,294,302-3,311,313, 
319,320,321,325,326,332, 334-5 
Aranesl Aranese see Val d’Aran, Vail 
d’Aran 

Bearnese 122,130,133,190,206,237 
Biarritz 205 
Foix 20 

Gascony 122,205-6,230,260,265,310, 
313,324,335 
Gevaudan 303 
Languedoc, Western 258 
Limousin 105 
Menton 205 

Monaco, Monegasque 190 

Nice, Nifois 190,213 

OW3,57,82,89,116,135,190-1,239 

Provence, Provenfal 3,43,105,216 

Pyrenees 190-1,205-4 

South-west 174 

Standard 216 

Val d'Aran 190 

Western 122,176 

Palenquero see Creole 
Paninarese see Italian 
Papiamento see Creole 
Parisian see French: Paris 
Persian 321 

Philippine Spanish see Creole 
Picard see French: Picardy 
Piedmontese see Italian: Piedmont 



364 


Index of languages 


Portuguese 6,12-16,25,27-9,35,40-2,47-9, 
56-9,67,77, 79-91,93-4,100,109-14, 
120, 124-5,127-8,131,134-6,138-9, 
140-5,148-50,160-1,165-6,169-70, 

171- 2,177-84,189,191,197-201, 
208-9,215,218,220,224,229,231, 
234-5,237,239,243-6,250-3,257-8, 
261-2,264,266,269-72,275,279,290, 
292, 294,297-304,306-7,309-13, 316, 
318,319,321,323-7,334, 339,340 

Angolar see Creole 
Asian 220 

Brazil 42, 49, 51,55,144, 166-7,170, 

172- 6,209-10,220,231,243,262,264, 
330,341,339; Rio 221 

Cape Verde see Creole 
Creole see Creole 

European 170,175-6,209,231,243^, 

262,264 

Fronteirifo I Fronterizo 339 
Guinea-Bissau see Creole 
Lisbon 42, 209 
Macau 220 
Malacca see Creole 
Mozambique 341 

OW82,122,170, 178,244,261,309 
Northern 28 
Sao Tome see Creole 
Sri Lanka 74, 220; Batticoloa 220 
Standard \4\, 170,252 
Principe see Creole 
Provencal see Occitan 
Puter see Rhaeto-Romance: Ladin 

Quebecois see French: Canada 
Quechuan see Amerindian 

Reunion see Creole 

Rhaeto-Romance 22,44, 57,117-18,122-3, 
192-3,194,200,260,277,290,303,310, 
324, 326, 337,340 

Engadine Engadinish/Engadinian/Ladin 
13-15,77,81, 86,88-9,91-2,104, 
113-14,122-3,128,131, 180,193, 
196-7,200-1,217,226,229,245-6,247, 
260,277,278-9,291,299-300,306-7, 
311, 321,322, 325,326; Inn Valley 193; 
Lower ( Vallader) 44, 193, 260; Remus 
44; Santa Maria 44; Mustair, Val 88, 
193; Upper (Puter) 44, \92, Zernez 44; 
Zuoz44 

FriM/i (Friulian, Friulan, Frioulian) 12-14, 
15,21,27,45,48, 50,78,79,82,130,133, 
169,193-4,197,200,218,261,302, 337 
Italy (see also Dolomites, Friuli) 117 


Ladin, Dolomites 15,45,82,87,104,200, 
337; Alto Adige 337; Ampezzo 45; Fassa 
(Val di) 200; Gardena 82, 87; Pddola 
(Comelico) 45 

Romansh Romontsch, Roumanche, 
Roumantsch-Grischun, Upper Rhine 
44.95, 105; Surmeiran Surmeirish, 
Oberhalbsteinisch, Unterhalbsteinisch 
44,63,127,193; Surselvan, Sursilvan 
Surselvisch, Oberwaldisch, Churwalsch 
17,21,23,44,54,55,77,79, 81,84,86, 
88,105,117 19,123,159,167,176,180, 
193,217,249,277,279,291, 303-4,325, 
326; Brigels 44; Old 64; Tavetsch 44; 
Sutselvan, Sutsilvan (Sutselvisch, 
Nidwaldisch) 193 

Switzerland (Graubunden, Grisons) (see 
also Romansh, Engadine) 36,63,88, 
117,125,127,131,180,192,216-17, 
239,278,309, 340 
Roman see Italian: Rome 
Romany see Gipsy 

Rumanian/Romanian 4,6,8, 12-15,29, 
36-8.43,48-9,57,63,65-7,69,77, 

79 -84,86-95,100-1,105,107, 109-15, 
120, 122-4,127,130,132-^, 137-44, 
147-8,150,157-9,161,164-6, 169, 
171-2,176, 179,181 3,185,189,194-5, 
197-201,207,213 -14,217-18,225, 
229-30,233-5, 243,245-7,250,252-4, 
256,261,263-4,266-7,275,277-92, 
297, 300,302, 306-7,309,311-12,314, 
316,317,319, 321-7 
Albania 211, 233 

Arumanian (Macedo-Rumanian) 89, 217, 
302,319 
Banat 171 
Bucharest 213 

Daco-Rumanian 16,28,88,91-2,122,139, 
171, 196,207,213,217-18, 319; Old 16 
Danube, South of 16, 89, 122, 165, 207, 
243,250,253,321 
Danube delta 194 

Istro-Rumanian 43,88-9,93,95,196,218, 
233, 340 

Macedo-Rumanian (Arumanian) 217, 233 
Megleno-Rumanian 92,217 
Mo Wav/an (Moldovan) 139, 169 
Old 3m 
Oltenian 137 

Standard see Daco-Rumanian 
Translyvania 169 

Sao Tomense see Creole: Sao Tome 
Sard(inia) 13,20,24,36,38,42, 57,79-81, 
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86-95,100,107,113-15,120,122-3, 
127-31,135,148,157-8,165,174-5, 
179,183,185,195,197,199,200-3,217, 
226,229-30,235,246,261,266,268, 
275-9,286,290-3,300, 302,306, 
309-11, 316, 317,323,324,326, 336-7 
Bitti 235 

Campidanese, Campidanian 13 14,25,42, 
85,195,236,292-3,309 
Gallurese, Galluran 195 
Logudorese, Logudorian 12-14,42,82,85, 
86,229,239,290, 309,322 
Northern 195 

Nuorese-Bittese (Nuoro) 81,84,85,128, 
174,235 

Proto-Sardinian 197 
Sarrabus 25 
Sassarese (Sassari) 195 
Sassarese see Sardinian 
Scottish 238 

Sefardi, Sephardic see Spanish 
Serbo-Croat 93,195 
Seychellois see Creole: Seychelles 
Sicilian see Italian: Sicily 
Slavonic 17,29,65,83,88-9,93,96, 123, 
171-2,181,213,243,252-4,256,290, 
321,324, 325,326, 340 
Slovenian 193 

Spanish 12-16, 35,40,42,47,55-9,63, 

66-7,77-85, 87,89-91,93-4,100, 

104,109-14,119-22,124-5,127-8, 

131,134-6,138-40,142-6,148-50, 

158- 60,162-3,165-6,168-70,171, 
175-80,182-^, 189,191,197-202, 
207-9,215,219,226,229-32,234-5, 
237,240,242-3,245-6,250-1,256-8, 
261-72,275,277-9,286-7,289,292-5, 
297-303,306-7, 309-19,321^, 326, 
335, 338-41 

Amazon, Peruvian 242 
American 48,50,78,208-9,218,226,230, 
252,262,310,341,342 
Andalusia(n), Andalucia 25,49,230, 

251-2, 323,338 
Andes 137 

Aragon(ese) 50,130,133,191,207,215, 
237,261,319,338 

Argentina River Plate, {see also Cocoliche) 
49,78, 322, 330, 339 
Asturias, babies 50,60-3,101,119-20, 

159- 60,207,215,338 
Bolivia 341 

Canary Islands 252, 341 
Castilian, Castile 122,133,139-40,148, 
198,208-9,215,242,251,256,258,295, 


310-12,319,332,338 
Old Castile, Castillo Viejo 242 
Chicano Mexican-American 330 
Chile A9 

Cocoliche see Argentina, also Italian 

Colombia {see also Palenquero) 74,231 

Costa Rica 67 

Creole see Creole 

Cuba 341 

Ecuador 262 

European 184,209 

Fronterizo see Portuguese: Fronteirifo 

Judeo-Spanish see Sefardi 

Judezmo see Sefardi 

Galician, Gallego see Galician 

Ladino see Sefardi, American 

Leon{ese) 50,207-8,319 

Madrid 263 

Medieval see Old 

Mexico {see also Chicano) 25 

Mozarabic 207-8 

Murcia{n) 207-8 

Nicaragua 341 

North of Spain 50,262,263 

North America 50 

OW16,56, 59,79,82,109,121,139,159, 
178-9,231,245,252,261,268,275,293, 
296,299, 309-10,313, 324 
Palenquero see Creole 
Paraguay 262,341 
Peru 67 

Philippines {see also Creole) 74,93 
Puerto Rico 67 

San Domingo (Dominican Republic) 51 
Sefardi, Sephardic, Judeo-Spanish 95,218, 
233, 342-3 
Seville252, ^\\ 

Southern Spain 208 
Standardm,3\\ 

Toledo 208 
Uruguay 339 

Sursilvan, Surselvisch see Rhaeto-Romance: 
Romansh 

Sutsilvan, Sutselvisch see Rhaeto-Romance: 

Romansh 
Swedish 239 

Tessinese, Tessin, Ticino see Italian: Swiss 

Thracian 245 

Thraco-Dacian 92,243 

Trinidad see Creole 

Turkish 96,254 

Uruguayan see Spanish: Uruguay and 
Portuguese: fronteirifo 



366 


Index of languages 


Valencian Catalan West Romance 38,107,119,171,177,181, 

Vallader Ladin see Rhaeto-Romance: Ladin 197,213,234-5,237,244 

Venetan^ee Italian: Veneto 

Venetian ree Italian: Venice Yugoslavia 340 

Walloon see French 

Welsh 105, 238 Zamboangueno see Creole: Philippines 
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ablative 81 
abstract see noun 
‘abstract’ analysis 26 
Abstand (‘stand-off’) language 206-7 
accent, accentuation 40,47,79,99,100, 
108-11,154,157,235,285-7 
placement 108 
stress 174,246 
tonal 108 

accommodation 206,218 

accusative, acc. see case 

accusative + infinitive 143-6,163 

actualization 37 

adjectival article 148 

adjective placement 36,146-7,201 

adjectives 36, 55,58, 59,60,62,67, 81,83, 

86,90,105,118,123,128,146,277,279, 
325-6, 

adnominal (‘epithet’ or ‘attributive’) 
117-18,129 
determining 146 
nominalized, substantivized 59 
predicative (i.e. post-copular) 257,277 
adstratum 232-3 

adverbs, adverbializer 74,81-3,90 
affective 146,147 

affix, affixation 81-7,130-1,174,178 
inflectional 174 
affricate, see consonant 
agent 19,179 

agglutination 22,74,75,84,123,178,200 
agreement 23, 36,48,63,64,68,135,141, 
143,148,168,201,257-61,269-71, 

286 

verb 270-4 

Albigensian crusade 190 
Alliance Israelite Universelle 343 
allophone, allophonic 241,287,295 
ambigeneous see gender 
analogy 57,134,297-301, 313,319 


analytic 37,136,156,177,185,200, 314 

anaphoric 126, 129,167 

anaptyxis 99,109 

animate 55, 58,66,69,121,262-3 

apheresis 290 

apocopation, apocope 50 

apophony 162,297-8 

Appendix Probi 12, 15, 102 

approximant 242,289 

archetype 37 

argot 331 

argument 52,129,168,257,267 
article (see also definite, indefinite, partitive) 
59,61,64,87,123,126-7,128,173, 

277 

articulation 
apical 113 
dental 113 
laminal 113 

aspect, see marker, TAM, perfect, imperfect, 
progressive 

aspectual forms 75,134,156,175-6, 339 
periphrastic 156 
assibilation 140 

assimilation 110,112,115,156,234,236, 
248,268,287,322 
regressive 27,291,293 
assonance 27,29,159,287 
atlas, linguistic 202-5 
atonic 15,29, 36,44,100 
attitudes 343-4 
attrition, phonetic 52 
augmentatives 84 

Ausbau (‘built-up’) language 206,207 
auxiliary 7,15-24,36,75,130,135-6,154 
modal 166, 
perfect 257-61 

auxiliary selection 15-24,257-8,331 
^£■16,20,180 
HAVE 20 ,257 
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babies 338 

back slang, verlan 332 
backing 288 
Balkanization 207 
Bartsch’s Law 247-8,289 
bilingualism 232,282, 335-8,341 
asymmetrical 334 
Binnendiglossie see diglossia 
bioprogramme 224 
bleaching 6,18 

borrowing 85,93,151,185,244-5,250,325, 
235,330,339,341 
bound forms 40 
Bourbon 208 

breaking (see also Bartsch’s Law) 159 
bricolage 317 

cajun 223 

Carolingian reforms 152-3,231 
case 12,115-20,122^, 127,140,154,155, 
199,230 263,327 
ablative 314 

accusative 12, 54,61, 63,119,122,124-5, 
127,129,229,259,262,286,305-6; per¬ 
sonal, prepositional (PA) 121-3 
dative 21,78,121,124-5,127,169,201, 
306; ethic 125,266 
direct 123 

genitive 123,275,278,314 
nominative 78,115-20,123-4,127,273 
oblique 115-20,123^ 
vocative 123 
case marker 275 
case vide see chain-shift 161 
causative, factitive 7,144-5,164,201,266 
ceceo 252 
cedilla 140 

centralization, vowel 140,243-4,252 
chain-shift 161,237 
drag-chain 161,239 
push-chain 161,239,240 
chtimi 333 
circumflex 140 
classification 71,196-206 
genetic 196-200 
historical / synchronic 199-200 
quantitative 199 
subgrouping 197 
typological 200 
clause 

complement, completive 144,149,163 
finite 144-5 
main, matrix 163,248 
subordinate 142-5 
clause union or matiness 263 


clefting 286 

clitic, cliticization 21, 36,52, 54,77,78,79, 
121,125,170,186,281,303,305 
adverbial 20, 169,251,261-2,267,268-9 
array, cluster 167, 266-9 
climbing 21,263-6 

doubling, copying 122,168,201,253,265, 
341 

enclisis 44,45,50,127,169,170,174,267, 
310 

neuter 169 

object 36,45,103, 121,127-8,129, 167-74 
phonological 51 

proclisis46, 52,169,174,185,265,267 
reflexive 78,264,266 
subject 46,47, 52,70,202 
syntactic 51 

cluster, consonantal 107,291-4 
medial 15,101,112,113 
word-initial 112 

coda, syllable-final 42,66,108,160,287,290 
code-switching 221,223 
codifiers, codification 58,210,214,255,267, 
283,335 
CODOFIL223 

cognate 11, 12, 85,273,311,313 
cognate object 24 
cognitive structure 6 
collective 88,275 
comparative construction 314-17 
comparativist, comparison 11, 71,97,149, 
181 

compensatory lengthening 108,160 
complement 313, 316 
complement clause, complementation 144, 
163,186,203,304,314,317 
complementizer 141,149, 307, 315 
prepositional 163 
QU- 145 163, 307, 317 
complex nucleus 161 
compound, compounding, compound 
words 15,23,81,200,310 
compound tenses (see perfect, pluperfect, 
surcompose) 132,137,154,156,201, 
257-61 
conative 265 
concrete 27,61,66 

conditional (see also tense, future in the 
past) 16,133 
conflict 330, 332-5,337 
conjugation see verb 
conjugation-class 130-1, 318 
conservatism 326-7 
consonant 62,65,69,99,110,113,197 
affricate 12,139,193,240 
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apical 294 

back 112 

continuant 242 

dental 113,139,240,251,294 

flap 294 

fricative 113,251,294,295 
front 111, 113 
interdental 295 

intervocalic 26,27,234-7,281,327 
labial 289 
lateral 326 

long (geminate) 106, 111, 112,140,199, 
234-7,290,292,293,294,326 
nasal 25-9,42,247 
obstruent 247,290,292-3 
palatal, palatalized 110,111,289 
palato-alveolar, postalveolar 12,113 
plosive, stop 113,241,281,295 
retroflex 244,326 

sibilant 113,234,251-2,281,295-6; api¬ 
cal 231,251; dental 12,294 
syllable-flnal see coda 
velar 111,112,115,239,294,295,298,326 
voiced 114,295 
voiceless (surds) 199,241 
consonantal clusters 43,111 
eontact 345 
contamination 82 
continuous see progressive 
continuum 
creole 221 

Romance 94-6,163,223,339 
control constructions 163 
convergence 155-86 
copula 179,257,312,313 
core languages 92 
corpora, language 330 
corpus planning see planning 
Council of Tours (813) 152 
count noun 63 

creole 25,35, 36, 39, 51,54,55,68-75, 80, 

90,92,156,166,175-6,185,219-26, 
340,341,342 
creolization 3, 93,96, 167 
Crusades 172,255,343 
cultismos see Latinisms 
cursus 286 
cycle, cyclical 176 
Cyrillic script 28,139,190 

Dark Ages 103,227,251 
dative see pronoun, case 
declension see noun 
decreolization 75,76 
default 291 


definite article 36,59-61,66,68,123,126-7, 
129,154,199,275-6,286, 306,318,326, 
340 

definiteness 121,168,269 
deixis see demonstrative 
delayed release 240 
delinking 161, 239 
democratization 344 
demonstrative, deictic 47, 59, 68, 126, 
279-80,286,318 
distal 125-6, 129,278-9,317 
middle distance 279 
neuter 279-80 
proximal 279 
denasalization 27,28 
dental see consonant 
depreciative 89 

derivation 58,84, 85,91,97, 101,200,254, 
309 

derived subject 24 

determiner 123,201,274-81,286,317-18 
zero, non-overt 275,276 
devoicing 247,252,295-6 
diachronic 11,27,38,70 
diacritic 139 

dialect 11,21,39,40,41,45,46,49,53, 57, 
61,62,64,68,75,78,92,101,121,184, 
190,192,194-5,202-6,210,226,241, 
283,305, 331,333,337,340 
dialectalization 227 
dialectology 226,329 
dialectometry 202-6,226 
differentiation see dissimilation 

diglossia 153,184,232,335-8, 341 
Binnendiglossie 335 
dilalia 336 
digraph 158 
dilalia see diglossia 

diminutive 37, 58,67,84,101,111,324 
diphthong 13,25-7,161-2,230, 238-9,241, 
285,288-9 
falling 158, 247 
heavy or light 159-61 
levelling, fusion, resolution 160,285-6, 
288 

nasal 25,42 
rising 158 
upgliding 158 

diphthongization 9,102,108,157-9,186, 
236,247-8,253,276,287-9,298,327 
conditioned 159 
spontaneous 158 

direct object see object dissimilation 
disjunctive 51, 52, 54,79,124-5,278 
disloeation 286 
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left46,129,168,201 
right 129 

dissimilation 134,158,248,292,322 
DOMs {Departements d’Outre-Mer) 341 
doublet 66,288,294,319 
doubling see consonant, long 
doubling, syntactic 62 
drag-chain see chain-shift 
drift 155,176 

e instable, e caduc, mute e 27,173,292 
‘Each word has its own history’ 203 
ellipsis 54,314,315,318 
enclisis see clitic 
epenthesis 28,286,291^ 

Equi-NP see control constructions 
ergative 20 
ethnicity 192 

etymology, etymon 13,44, 73, 96, 100, 108, 
118,127,244,250,263,298,305, 321 
Eurotype 200 
exaptation 119,310-19 
experiencer 19 

expletive 46, 52,168,304,314 
extrametrical 27 

factitive, causative 7,144-5 
family, language 38-9,71,96 
family tree 197 
feature splitting 242, 287 
feminine see gender 
filter 267 
finite see verb 
fission 239 
flipflop 160 
forclusif303, 305 
fortition 234, 285 
franfais avanc^ 156,220 
franfais fondamental 151 
francitan 335 

Franco-Prussian war (1870-1) 5 
Franks 245 
free form 83 

French Revolution (1789) 203,211,288 
frequency 90 
fricative see consonant 
fronterizo 339 
fronting 237-41,259,296 
functional load 161 
function class 80 
function words 9,72 
future 77,103,177-9,186 
GO-future 176 
synthetic 177-9 
stem 178-9 


Gallicism 183 

geminate see consonant, long 

gemination, doubling 234-7 

gender 14, 36, 55-69,70,83,96,119,125, 

141.169.246.253.278.286 
ambigeneous 63-6 
assignment 69 

classes 55 

feminine 14,44, 55,56, 57, 58,63,64,65, 
66, 67,68, 88,123 
fluctuating 56, 58 
function 66 

masculine 55, 56,57, 58, 59,60,61,62,64, 
65,66,69,117 18,119,127-8 
neuter 55, 56, 58, 59,60,64,65,70,83,87, 
88,101, 327; abstract 60; article 61-2, 
318; collective 61-2; neutro de materia 
60 

generative grammar 6,19,27,45 
‘classic’ 26 
generic meaning 275 
gerund(ive) 75,174,175 
genitive see case 
Gipsy 213,340 

glide, palatal 43,45,111,113,159,230, 287 

glottalization 231 

Golden Age 156,183,228 

gorgia toscana see spirantization 

Graeco-Latin tradition 140 

grammar 115, 140-1, 157, 171,208,220, 

280,284,341 

grammarians 126,128,140 
grammaticalization 6-7,18,70,76,121-2, 
135-6,258,268,273 
graphy see spelling 
grasseye (see uvular r) 294 

hardening 45,114,298 
harga’s 251 
heterosyllabic 292 
hiatus 289 
historiography 10 

homonyms, homonymy, homophony 45,47, 

66.67.286 

honorifics 48,49,53, 78 
hypercharacterization 305-19 
hypercorrection, hyperurbanism 44,117, 
265,291 

iconocity 156 
Idealism 5,6 
idiolect 189 
imperfect see tense 
imparisyllabic 117 
imperative 132, 142, 143, 173 
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imperfect endings 133-5 
imperfective 93 

impersonal {see also indeterminate agent) 
22,24,46, 53,118 
inanimate 65,69,262-3, 306 
inceptive 299 
inchoative 131 
incorporation 147 
indefinite article 59,68, 87,275,276 
indeterminate agent, impersonal 268-74 
indicative see tense 
indirect speech 133 
infectum and perfectum 132,179 
infinitive 75,130,144,163-6,175-6,186, 
201,263-5,340 

conjugated, inflected, personal 41,132, 
142,145, 165-6,209 
historical 163 
prolative 163 
substantivized 164-5,173 
inflection 7,40,45,51,52,109,115,118, 
122-4, 130 
ending 49, 50 
personal 40, 52 
verb 36 

inherent pronominal see pronominal verb 
insecurity, linguistic 332 
intensive affix 84-5 
interdental see consonant 
interlanguage 282,344 
interlects 38, 335, 339 
‘internal7‘externar relationships 115 
interrogative 143, 249, 306-10, 318 
intervocalic 26,27,94,234-7,281,294 
intransitive 19,136 
inversion 309 
complex 249,309 
interrogative 249,331 
stylistic 308 
subject 20 
invisible hand 156 

irregular 130, 133, 134,296, 300, 313 
isogloss 202,204,246 
iterative 85 

Jacobin 211 

jodization 103,111,253 
jota 251 

koine, koinai 38,202,204,206,210,217,338 
koineization 206,208, 336 

labial 115 
labio-velar 162 
laminalization 231 


language acquisition device 152 
language community 71 
language death 93,334 
language loyalty 337 
language names 104 
language nationalism 332 
language of revolt 331 
langue romane 3, 191,246 
Latinisms, learned word, cultismo 138-9, 
150-1 

Latinitas 344 
lax 295 

leismo 263,285 
lenition 234-7,285, 336 
lexicalization 20,22,28,29,161,236,288 
lexicochronology 197-8,226 
lexicology 320 

lexicon, vocabulary 9,11,35,37, 39, 70,71, 
80,87-94,96,97,100,102,106,110, 
138-9,150-1,181^, 209,220,224,244, 
250,253,280,284,319-26 
lexifier languages 35, 72, 80,90 
liaison 26 

lingua franca 181,255,344 
lingua do preto 219 
linguistic atlas see atlas 
linguistic market 330 
literacy 211,333 

literary language 36,42,43,45,94,136,184, 
190-1,215 

loan-translation (caique) 253 
loanwords, borrowings 69,83,93,150, 

182-3, 238, 241, 243, 250, 252,253, 322 
locational adverbial 126 
locational goal 124 
locative 261,323 
Loi des Trois Consonnes 293 

manner 27 

marker 27,58,66, 73,74,122, 134 
dative 121 
feminine 55, 64 
gender 66-8,126 
graphical 55 
partitive 124 

person 39,51, 53,54,72,79,202,230,286 
tense/aspect/modal {TAM) 72-6,115; 
ANTERIOR 76; CONDITIONAL 73; COMPLE¬ 
TIVE 73; FUTURE 72,73,74,76; habitu¬ 
al 73,74,76; imperfectiveIA; 
irreal{is) 74,76; iterative 74; nega¬ 
tive 14-, non-punctual 74, 76; past 12, 
76; perfective 72,74; progressive 73 
maroon 90,219 
marranos 342 





372 


Subject index 


mass-noun see noun, non-count; mass 

matiness see clause union 

matrix clause see clause, main 

mechanization 6 

medial cluster 15 

metalanguage 10,140 

metaphony, umlaut 60,159-61,253,300 

metaphor 321 

metarule 155 

metathesis 14 

metric 119 

middle 20 

minimal pair 106,180,186,243,289,295 
minority language 333, 337 
modal, modality 36,72, 75,266 
monophthong, monophthongization 161-2, 
287 

mood (see tense, indicative; imperative; 

subjunctive) 141, 143 
mora 106,108,158-9,288 
moriscos 251 

morpheme 76, 84,95,108 
structure 9 

morphologization 6,9,29,159-60 
morphology 9,27,35-6, 39,47,49,53-6, 59, 
60,70-1,90, 115-16,121,126, 129, 
131-4,136,140-1,154-5,164, 176-7, 
246,280,284,296-302 
case 140 
verb 130-2 

morphophonology 65 

morphosyntax 9,11, 35,39,69,72,248-50 

mozarabic, musta'ribun 251 

multiple causation 284 

mute e, see e instable 

mutual intelligibility 254, 339 

muwassah poems 251 

nasalization, nasality 8,9,24-9,42, 119, 

140,199,235,292-3, 327 
nasal hardening 28 
nasal occlusion 28 
nasal vowels 24 6,29; height 28 
nation-state 211-14 
nationhood 214-17,283 
negation 148-9,154, 302-5, 315 
expletive negative 149,315 
negative attraction, concord 148-9 
negative incorporation 148 
negative operator 304 5 
negative polarity 148, 302, 316 
partial 149 
reinforcer 303-5 
Neogrammarian 4,150,203 
neuter see gender 


nominal 36, 37,45, 79,96,115-17,123 
nominalization 132 
nominative, nom. see case 
nominativus pendens 201 
non-count see noun, non-count; mass 
non-standard usage 21,22, 51, 66, 68, 184, 
202-3,254,262,280,296-7, 308, 
310-11,330,333 
non-pro-drop see pro-drop 
norm, normative 2, 58,182 
normalization 215 
normativization 215, 255 
noun 36,45,46, 56,58,60,61,65, 67,68,69, 
73,77, 80,116-18,120,123,126, 128-9, 
323-5 

abstract 55, 58,61,66,126 
agentive 85 

ambigenous 63,65, 66 
collective 122 
concrete 323 
count 118 

declension; first 55,123; fourth 57; second 
55-6,119,123; third 56, 57,64,68,120 
inherited 73 
instrumental 85 

non-count, mass 60, 61,62,63,66,118, 
120 

NP (noun phrase) 129-30, 186,259 
nuclear vowel, nucleus see vowel 
complex 161 
null subject see pro-drop 
number, noun 37,126,141,229 
plural 45,48-9,56-8,61,63-7, 116-18, 
122-3,125-6,230 
numeral 59, 87-9,253 

object 58,121,124, 129 
direct 19,23,259 

object pronouns, ‘atonic’ see pronoun 

objective conjugation 168 

obligative 177 

oblique see case 

obstruent see consonant 

onomasiology 320 

onomatopoeia 323 

onset 158-9, 287,318 

optative 142 

orthography see spelling 

Ottoman 196,207 

oxy tonic 14 

palatal, palatalization 9, 110-15, 150, 229, 
231,239^1,252,291,293, 298 
panchronic 12,14-15,43 
paradigm 120,131,135,297-301 
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paradigmatic 37 
paragogic 42,290-1,300 
parameter setting see Principles and 
Parameters 
paroxytone 109,237 
participle 135 
passive 179,258 
participial adjective 179, 258 
past 16,22,62-3,75,135-6,147,201, 
257-61,318-19 
particle 128 
adnominal 129 
identifying 128-9 
invariable 53 
partitive 201,262,273-6 
prepositional 274 
pseudo-partitive 273 

passato remoto,passe historique, passe defini, 
see tense preterite 

passive, passivization 18,20,23,103,124, 
132-3,135,179-81 
a£-passive 180,272,313 
S£-passive 180-1,269-74, 313 
periphrastic 36 
past see tense 
patois 216,219 
Peregrinatio Aetheriae 103 
perfect 23,132,136,177 
active 16 

a£-perfect see auxiliary 
compound 23,62,133-7,257-61,299 
future 132 

//,tF£-perfect see auxiliary 
historic 132 
indefinite past 135 
indicative 16 
periphrastic 36 
present 132,179,299 
subjunctive 16,17 
perfective 93,299 
peripheral 8,155-6 

periphrastic 36, 72,73,76,88, 89,156,175, 
177,179, 300,308 
passive 36 
perfect 36 
progressive 72 

person, person markers 22,39-55,127 
first and second plural 47-51 
personal infinitive see infinitive 
philology see comparativist 39 
phonemes, phonemic 26, 50,68,106,162, 
173,243 

phonetics 35, 35,39,43,68,71,87,94,95, 
110,111,240,285 
phonologization 29,241 


phonology 9,11,27,35, 50,51,60,61,71, 

78,97,102,105-6,108,110, 111, 126-7, 
130,150,155,197,199,209-10,232, 
240,280,285-96,341 
generative 26,45 
phonotactic 290 
phrasal 317 
pidgin 224 

pivot, link particle 314, 316 
place (phonology) 27 
planning 
corpus 215,255 
status 215 

pleonastic 44,46,314 
plosive see consonant 
pluperfect 137,311 
plural see number 
polar questions 249,308,310 
polite form see honorific 
polynomic standard 211,216 
Pompeii 100,234 

popular 22,49,83,85,100-1,102,138,150, 
325,331,333 

possessive 59,68,77, 84,118,124,126,136, 
276-8 

adjectival 129,276-8 
reflexive 278 
strong 129,276-8 
weak 129,276-8 
potential 142 
pragmatics 8,146 
predicative 18 

predicative relative see tensed complement 
prefix 74,75,84,267,290,302 
preposition 115-16,123^, 127,144, 323 
preposition stranding 342 
prepositional accusative (PA) 121-3,168, 
200,201 

prepositional phrase (PP) 261 
prescriptive 297 
present indicative see tense 
preterite see tense 

Principles and Parameters 2,7,10,20, 51, 

70,76,167,201 
Principle of Least Effort 285 
pro-adjective 315 
proclitic see clitic 

pro-drop, null subject 51,53-4,70,201,274, 
307 

progressive see aspect 
promotion raising 20,265 
pronominal see pronoun 
pronominalization 315 
pronominal verb 20-3,36,51,59,124,129 
impersonal 181,269-74 
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inherent 20,21,22,274 
middle 23,180,269 
passive 21,23,180,269-74 
reciprocal 20,21 

reflexive 21-3,49, 125,181, 253,269-74 
pronoun (see also clitic) 45-54, 59, 116, 121, 
124-5,127-8, 173,317-18 
adverbial 259 
anaphoric 47, 167 

clitic (q.v.) 20,41,45,46,52, 53,122,125 

conjunctive 167 

copying see clitic doubling 

dative of interest 169 

definite 59 

deictic (see demonstrative) 47,126 
discourse 47,167 

disjunctive, tonic 45,57,125,127,167, 
262,265 

object 21, 36, 258, 318; atonic 36 

personal 41,51,76,79,90,122; tonic 122 

pleonastic 44 

possessive 76 

postposed vestigial 52 

post prepositional 78 

preposed 49 

reflexive 264 

subject 37,40,41,43,45,46, 51, 53, 54, 

73,79,145,170,248,262,318 
third person 125,129 
proparoxytone 108-9 
prosthetic 289-91 
proto-language 11, 14,97,199 
prototype 115 
pseudo-adjective 147 
punctual 299,311 
purism 214,330,336, 338 
push-chain see chain-shift 

quality, vowel 107 
quantifier 275 

quantity, vowel (see also vowel, long) 107,158 
QU- (WH-) conjunction, complementizer 
145,164, 309,314,317 
question marker 307-10 
questionnaire, dialect 203 

mfforzamento sintattico 236-1, 286 
raising see promotion; vowel 
reciprocal see pronominal verb 
reconstruction, reconstructionism 4,11-15 
Reconquest 208,215 
redundancy 317 

reflex 13, 56, 57,85,114,123,229,262 
reflexive (see pronominal) 186 
reform, spelling 240 


Reformation 211, 344 
regular 40,99,300 
regularist 4 

Relational Grammar 19 
relative clause, restrictive 146 
non-restrictive 146 

relative pronoun, relativizer 305-6, 317-18 

Renaissance 152,207,227, 344 

Republican 330 

restrictive see relative clause 

resultative 313 

retraction 244 

retroflex 294 

Rhaetian 192 

rhotacism 28 

rhyme, rime 25,29 

rhythmic group 110, 286 

Risorgimento 212 

Roman Empire 152, 227, 231, 245, 343 
Romany, Romali see Gipsy 
Romania contimia 38 
Romanticism 333 
roofing language 206,217-18,256 
root 130 
rounding 288 

sandhi 9,285-7 
schwa 27,173,291 
scriptorium 210 
sdrucciole 99, 100,293 
segment, segmental 25,40,108 
tier 161 

semantics 10,18,48,60,90,92,94,96,115, 
129,133,135-6,154,156,284,319-26 
patterns 55 
semi-cuhismos 150 
sentence roles 122 
servitude grammaticale 143 
seseo 252 

sex-gender marking see gender 
sibilant, see consonant 
simplification 310-19 
sinalefa 286 

singular 22, 56, 57, 61,64,65,66,67,68, 
116,118-20,123,125,128 
skeletal tier 161 
slang 340 

small nation 214-18 
social variation 100 
sociolect 228,341 

sociolinguistics 54, 221,232, 329-45 
solidarity pronoun 169 
sound-change 120,150,255, 301 
sound-law 203 
specific 275 
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spelling, orthography 26, 27,119,139,151, 
209,210,230,236,237,240,243,247, 
283,286,287 
spirant see consonant 
spirantization, gorgia toscana 235,241-3 
Stammbaum (see family tree) 196-7 
standard theory see generative 
standardization, standard 21, 23,27, 37,47, 
51,54,121-2,136,141,143,149-50, 
168,178,205,215,256,276,280, 

329-34 

regional 202,218,254,284,295-6,299, 
338 

standardology 196,206-17 
stative 19, 313 

stem (see also future) 44,45,62,120,132, 
162,297 

stem alternation 117,297-9 
stem-stressed, rhizotonic 45, 51,297-9 
Strasburg Oaths (842) 177,273 
stress (see also accentuation), 50,54,99,100, 
108, 111, 131 
antepenultimate 100 
pattern 108-9 
rhythmic group 110 

style, stylistics 8,10,133,147,310,319,329 
subject 19,20,22,45,46,62,74,83,124, 

129,145 

subjunctive 132,141-4,149,154,164,298, 
304,314,315 
future 132,142,209 
imperfect, past 132,142,310-11 
perfect 132, 142 
pluperfect 132,142 
present 142, 300 

subordination (see also clause) 186,315 
substratum 152,202,232^5,281, 320, 340 
suffix, suffixation 44-5,64,69,74,84-7,99, 
111, 133, 147,309 
bound 83 

derivational 85-7,254 
superstratum 232-3,245-54,281,320 
supine 144 

suppletion 131-2,296,313 
surcompose 137 
surds see consonants, voiceless 
surface ordering 201 
SVO, SOV see word order 
syllabicity 288 
syllable 13,108-9,119,131 
accented, stressed, tonic 101,107-8,160, 
162,239,246 

antepenultimate 41, 99,109 

atonic, unstressed 99, 107,109,162,246 

blocked, checked, closed 239 


final 109,246 

free, open 40,158,160,239 
length 248 

penultimate 40,99,107,109 
pretonic 239 

weight 107; light 108; heavy 109 
synchronic 10,26,38,96 
syncope, syncopation 15, 99,100, 102,111, 
292 

syncretism 116,141 
synonym 92,255 
syntagma 59,81,126 
syntagmatic 37 

syntax, syntactic 10,35,37,62,103,115, 
129-30,154,176-7,254-80,284,302-10 
synthetic 156,176-8,185,200-1,331 

(tense-aspect-mood) see marker 
tautosyllabic 27,108 
template 298-9, 313 
tense see lax 

tense (see also subjunctive; perfect) 37 
aorist 15,179,299 
future 177-9,185,331 
future in the past 16,177-9 
future subjunctive 327 
GO-past 300 

imi»rfect41,44,50,74,75 
indicative 39,42 
pluperfect 327 
present 39,40,42-3,45,47 
preterite, simple past, remote past, past 
definite, past historic 299-301, 313, 327 
tensed complement, predicative relative, 
attributive relative, pseudo-relative 201 
thematic vowel 44, 50,297, 300 
theme, thematic 19-20 
theta role 19 
Tigani see Gipsy 
tilde 25,140 
timbre 55 
tmesis 178 

Tobler-Mussafia Law 170 

tonic see vowel, syllable 

topic, topicalization 46,126,201,248-9, 

275,286 
transitive 21-2 
transparency 156 
Tre Corone 212 
triphthong 161 
troubadours 190-1 
truncation 26,99,286 
type, typology 2,6,70,226 
‘holistic’ 36 
‘integral’ 36 
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umlaut, metaphony 101-2,118-19,154, 
159-60 

unaccusative 19-24, 52,62,135,259 
unaccusative/ergative hypothesis 20 
unergative 20 

Universal Grammar (UG), 283 
universal 6 

Ursprache see proto-language 
uvular r 244,294-5 

Vatican Council (1962) 152,154 
velar, velarizing 12,243,294-300 
verb-first (VI), verb-second (V2) see word 
order 

verb phrase (VP) 53 

verbs, verbals 36,44,35, 36, 51, 52,53,54, 
71,73,80,83,93,128,131,133,135, 
320-2 

conjugation 41,42,47,130-2,133^, 177, 
300 

finite 135 

inflections 36,39,46, 52 
intransitive 15,16,17,18,20,83,135 
irregular 130,134 
lexical 36,134 
paradigm 60 

perception, see verba sentiendi 

pronominal 36 

regular 96, 130,177 

system 39, 115, 140 

transitive 18,23. 63 

volition 143-4 

voluntative 142 

VERBA DECLARANDI, verbs of saying 144 
VERBA SENTIENDI, verbs of perception 144, 
264,266 
verbi servili 17 
verlan see back slang 
Villers-Cotterets (Edict, 1539) 210 
Vlach, 105,207 
vocabulary see lexicon 
vocalization 292-3 
voice see passive 
voiceless stops, intervocalic 94 
voicing (see also lenition) 231,232 
volgare illustre 141 


VOS (Verb-Subject-Object), see word order 
vowel 55-6,61-2,65, 68-9,99,106-8, 111, 
113,119,197 
back 240 

central, centralization 140,243-4 
front, fronting 112,240,244,298 
high, height 28,157,243,253 
high-tone 99 
intertonic 111 
lax 157 

long, length 13-14,106-8,157,244,253 
low 28 

lowering 253,288 
low-tone 99 
mid 158,231,240 
nasal see nasalization 
non-peripheral 157 
peripheral 157 
raising 161, 288 

round, rounded, rounding 241,288 
schwa-type 99 

short99,106-7,109,111,157 
theme, stem-vowel 40,42-3,130,135 
tonic, accented 44,99,199 
unaccented, atonic 292 
vowel harmony (see also umlaut) 160 
Vulgar Latin 102-3 

Wackernagel’s Law 170 
weather verbs 23-4, 53 
WH-iceQU- 
word formation 9, 80 
word order 53,117,202 
noun-adjective 36 
surface 36,146 

SVO (subject verb-object) 36,61,146, 

202.248.271.304 

verb-first ordering, VOS (VI) 36, 52 
verb-last (SOV) ordering 103,146, 167, 

178.202.304 

verb-second order (V2) 53, 54, 248-50, 
304 

word stress 246 
zero marker 40, 45 



